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Preface 


This book is about the general theory of relativity which is concisely labeled as general relativity. The book 
is based in part on personal notes and tutorials about topics and applications related to modern physics 
and tensor calculus as well as many insights and analyses that were acquired during my academic and 
research activities. During the writing of this book, I read and consulted many references (i.e. textbooks 
and scientific papers) on this subject; the main of these references are listed in the References section 
in the back of this book. I also read and consulted many general articles, comments, notes, discussions, 
controversies, etc. which I mostly found on the Internet. The book, therefore, is the result of a rather 
extensive view to the literature of this theory over most of its lifetime reflecting various stages of its 
development. 

The book contains many solved problems (129) as well as many exercises (606) whose detailed solutions 
are published in another book that accompanies the present book. The book also includes a detailed 
index and many cross references (which are hyperlinked in the digital versions) to facilitate the connection 
between related subjects and parts and hence improve understanding and recollection. The book can be 
used as an introduction to general relativity at undergraduate and graduate levels. 

Unlike most other books on general relativity which are mostly dedicated to the presentation, justifi- 
cation, application and validation of the formalism of the theory (and hence rather minor attention is 
usually paid to the interpretation and epistemology of the theory), this book is primarily interested in 
the interpretative and epistemological aspects of the theory. This is obviously inline with the assessment 
objective of this book, as declared by “Assessed” in the title. 

I should announce that unlike almost all other books on general relativity this book does not include 
any investigation to relativistic cosmology (and indeed to any other application of general relativity like 
relativistic optics) since it is entirely about the theory of general relativity itself and not about any of 
its applications and branches. This is partly due to the limitation on the size and scope of the book 
and partly due to the non-scientific nature of relativistic cosmology in general since it is a topic that is 
overwhelmingly based on mathematical deliberations and philosophical reflections with very little physical 
substance or evidence. 

I should draw the attention of the readers (and potential readers) of this book that “Simplified” in the 
title dose not mean “simple”. The reader of this book must have a strong background in physics and 
general mathematics and should be familiar with the basic concepts, notations and techniques of tensor 
calculus, differential geometry and special relativity. So, it is “Simplified” for the proper reader and not 
for every reader. Therefore, I strongly advise against acquiring or reading this book by readers who do 
not have such a suitable background to avoid frustration and disappointment. 

Finally, I should announce that it is a great advantage for the reader of the present book to be familiar 
with my previous book “The Mechanics of Lorentz Transformations’ because many topics, discussions, 
arguments, concepts, notations and techniques in the present book are based on what I established in that 
book. The familiarity with my other books on tensor calculus and differential geometry (all of which are 
available on Amazon) should also help the reader to understand and appreciate the contents of this book. 
Taha Sochi 
London, May 2020 
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Nomenclature 


In the following list, we define the common symbols, notations and abbreviations which are used in the 
book as a quick reference for the reader. The list generally excludes what is used locally and casually. 


V nabla differential operator 
Vf gradient of scalar f 

V-A divergence of vector A 
VxA curl of vector A 

Vv? Laplacian operator 


nabla differential 4-operator 


2 d’Alembertian 4-operator 
, (subscript) partial derivative with respect to the following index(es) 
; (subscript) covariant derivative with respect to the following index(es) 
nD n-dimensional 
5/5, 67/52 absolute derivative operators with respect to A 
0;, O° partial differential operators with respect to i” coordinate 
[i7, k], (pv, w] Christoffel symbol of the 1%’ kind in 3D (or nD) space, in 4D spacetime 
a mean distance (in planetary motion) 
a semi-major axis of ellipse or transverse semi-axis of hyperbola 
a acceleration vector in 3D space 
A area 
A constant in the orbital equations 
a, A, p” parameters in Kerr metric 
b semi-minor axis of ellipse or conjugate semi-axis of hyperbola 
B constant representing magnitude of angular momentum per mass or twice areal speed 
B2, B3, B4 author books (see References) 
B2X, B3X, B4X author exercise books (see References) 
c the characteristic speed of light in vacuum (= 299792458 m/s) 
ct, 7,0, Schwarzschild coordinates, Boyer-Lindquist coordinates 
Cs constant equal to Schwarzschild radius 
d distance 
dr differential of position vector 
ds, do infinitesimal line element in 3D (or nD) space, in 4D spacetime 
D, D# displacement (or deviation) 4-vector 
diag [---] diagonal matrix with the embraced diagonal elements -- - 
e€ eccentricity of ellipse 
€;, €9,€¢ basis vectors of normalized spherical coordinate system 
E, Eo energy, rest energy 
E;, E’ i*” covariant and contravariant basis vectors 
Eq., Eqs. Equation, Equations 
ESA European Space Agency 


f magnitude of force in 3D space 

F dimensionless factor in Schwarzschild metric (= 1 — [Rgs/r]) 

f force vector in 3D space 

ti tidal force 

g magnitude of gravitational acceleration or gravitational force field 
g determinant of covariant metric tensor 

g gravitational acceleration or gravitational force field 

G the gravitational constant (~ 6.674 x 1071! N m?/kg”) 

G trace of Einstein tensor 


Suv, haus foley 
SAO 


(superscript) 


BARNA 
4 
3 


erste ye 


Einstein tensor 

metric tensor of 3D (or nD) space 

metric tensor of 4D spacetime 

Global Positioning System 

Planck constant (~ 6.62607 x 1074 J s) 
reduced Planck constant (= h/27) 

functions of r in Schwarzschild metric 

if and only if 

magnitude of angular momentum 

Boltzmann constant (~ 1.38065 x 10773 J/K) 
length 

LAser GEOdynamics Satellites 

Laser Interferometer Gravitational-wave Observatory 
Laser Interferometer Space Antenna 

mass 

gravitational mass, inertial mass 

number of dimensions of space 

number of particles inside a given volume 
number of components or elements 

number density, proper number density 

number of planetary revolutions per century 
normal unit vector to surface 

National Aeronautics and Space Administration 
momentum vector in 3D space, in 4D spacetime 
electric charge 

radius, radial distance, radial coordinate 
position vector |e.g. (x,y, z) or (a, 2, 2°) or (a1,--- 
perihelion distance, aphelion distance 

plane polar coordinates 

spherical coordinates of 3D space 

radius, radial distance 

Ricci curvature scalar 

Ricci curvature tensor of the 1% and 2” kind in 3D (or nD) space 

Ricci curvature tensor of the 1° and 2”¢ kind in 4D spacetime 
Riemann-Christoffel curvature tensor of the 1°* and 24 kind in 3D (or nD) space 
Riemann-Christoffel curvature tensor of the 1° and 2”¢ kind in 4D spacetime 
Schwarzschild radius 

space interval (or length of arc) 

electromagnetic field strength tensor 

Smithsonian Astrophysical Observatory 

time, temporal coordinate 

periodic time 

temperature 

trace of energy-momentum tensor 

transposition 

time of a century 

unit tangent vector 

energy-momentum tensor 

1/r 

speed 

geodesic unit vector 

velocity vector in 3D space 


,v”)| 


velocity vector in 4D spacetime 

speed 

velocity vector in 3D space 

volume 

escape speed 

general relativistic effective potential, Newtonian effective potential 
Very Long Baseline Interferometry 

position vector in 4D spacetime, ie. (x°, 21, x, x3) 

i#’ Cartesian coordinate (or covariant form of general coordinates) 
general coordinates of 3D (or nD) space, of 4D spacetime 
coordinates of 3D space (usually orthonormal Cartesian) 
angle of precession per revolution (in geodetic effect) 
Lorentz factor 

Christoffel symbol of the 2”¢ kind in 3D (or nD) space, in 4D spacetime 
Kronecker delta tensor in 3D (or nD) space 

Kronecker delta tensor in 4D spacetime 

angle of deflection of light 

extra precession of perihelion per revolution 

extra precession of perihelion per century 

finite change 

trace of perturbation tensor 

the permittivity of free space 

perturbation tensor 

Minkowski metric tensor 

coefficient of the Field Equation (= —87Gc~*) 

geodesic and normal curvatures 

curvature vector 

geodesic and normal components of curvature vector 
natural (or affine) parameter of curve 

wavelength 

cosmological constant 

the permeability of free space 

frequency 

density, proper density 

cylindrical coordinates of 3D space 

spacetime interval 

Stefan-Boltzmann constant (~ 5.67037 x 10-8 Wm~? K~*) 
proper time 

gravitational potential 

perturbation tensor 

angular speed 


Chapter 1 
Preliminaries 


In this introductory chapter we present some preliminaries and background materials related mainly to 
gravity and general relativity. We also discuss general notations, conventions, symbols and terminology 
that we use in this book. A brief investigation of some general scientific aspects that will be used later in 
the book is also included. 


1.1 Synopsis about Gravity and General Relativity 


In simple terms, gravity is an attractive force that acts between two massive bodies. Although it is the 
weakest of all known forces, it is the most dominant and influential force in the Universe as it is the main 
factor that governs the physics of the Universe on large scale. The exceptional importance and dominance 
of gravity arise from its universality and long range since it is a property of matter regardless of its 
form and shape (unlike electric force for instance which is restricted to charged matter) and regardless 
of range (unlike strong force for instance which is restricted to short range). For example, gravity is the 
primary factor in the formation of groups, clusters, galaxies and stars. It is also responsible for the orbital 
motion of the celestial bodies such as the revolution of the planets in the solar system around the Sun. 
Therefore, general relativity as a gravity theory finds its natural applications in astronomy, astrophysics 
and cosmology. 

We may note the following characteristics about gravity:l!] 
e As indicated above, gravity is an attractive force and hence there is no repulsive gravitational force 
unlike electric force for instance which is attractive between unlike charges and repulsive between like 
charges. However, we should note that the subject of this remark is ordinary matter although there are 
indications that anti-matter also follows the same gravitational rules as ordinary matter and hence the 
gravitational force that involves anti-matter (i.e. if one or both massive objects are made of anti-matter) 
is also attractive. 
e Gravity is an attribute of mass (compared for instance to electric force which is an attribute of charge). 
However, since mass is a general attribute of matter, unlike some other restricted attributes of matter like 
charge, gravity can be regarded as an attribute of matter. 
e Gravity is a long range force and hence it dominates the physics of the Universe on large scale where 
the effects of other forces diminish. This dominance should also be attributed in part to its dependency 
on matter regardless of any restricted attribute of matter such as charge (see exercise 1). 


Regarding general relativity, it is essentially a theory of gravity!?] that can be characterized by the 
following: 
e It is a geometric theory of gravity and hence from a general relativistic perspective gravity is not a 
physical force but it is an effect of the distortion of the spacetime by the presence of matter and energy. 
e The fundamental physical pillar of general relativity is the equivalence principle (see § 1.8.2) which 
essentially abolishes the difference between acceleration and gravitation, while its fundamental mathe- 
matical pillar is differential geometry and tensor calculus where the theory heavily relies in its formalism 
on the concepts, language, notation and techniques of these branches of mathematics. More specifically, 
the role of differential geometry is mainly in conceptualizing and formalizing the geometric description 


(1 The following discussion about gravity and general relativity is for the purpose of setting the scene and hence it can be 
loose in some parts. However, most of the discussed issues will be revisited in the future in a more formal and rigorous 
way. 

[2] As we will see, general relativity has also non-gravitational generalizations and consequences which we refer to as “General 
Theory”. 


1.1 Synopsis about Gravity and General Relativity 10 


of multi-dimensional spaces, while the role of tensor calculus is to provide efficient and compact mathe- 
matical notations and techniques for the description of multi-dimensional spaces and their attributes and 
accessories in an invariant way. 

e General relativity has emerged to address certain limitations of the classical Newtonian theory of gravity 
(see § 1.5). Apart from its direct and common applications that it shares with classical gravity, the theory 
currently plays an important role in cosmological and astrophysical studies and provides a theoretical 
framework and motivation for these fields of investigation and research. 

e The essence of general relativity is that the presence and movement of matter and energy cause distortion 
in the surrounding spacetime and hence the spacetime in their vicinity becomes curved. Physical objects 
(like massive particles and light) under the effect of gravity will then follow geodesic pathsl*! in this 
curved spacetime. Accordingly, objects deviate from these geodesic paths only if forces (i.e. other than 
gravity which is not a force according to general relativity) do exist. So, if gravity is the only “force” that 
influences the objects then the objects will be in a state of free fall moving along geodesic trajectories (or 
world lines). 

e Roughly speaking, the effects predicted by general relativity that distinguish it from classical gravity 
are only significant and detectable in the strong gravitational fields!4! due to the presence of very large 
or/and very compact aggregates of matter such as black holes and neutron stars where the departure of 
the predictions of general relativity from the predictions of the classical theory of gravity is significant 
and observable. 

e The curvature of spacetime caused by the presence of matter!®! depends on the amount and density of 
matter in the spacetime and hence the curvature increases as the amount and density of matter increase. 
Accordingly, in the vicinity of small and sparse aggregates of matter the spacetime is minimally curved 
and hence it is virtually flat while in the vicinity of large and dense aggregates of matter the spacetime is 
significantly curved. 

e The curvature of spacetime also depends on the distance from the matter that causes this curvature 
and hence the curvature of spacetime diminishes as the distance increases. So, the curvature caused even 
by very large and dense aggregates of matter becomes negligible very far away from these aggregates and 
hence the spacetime is almost flat in such regions of space.|¢ 

e General relativity is a highly mathematical theory since it heavily relies on differential geometry and 
tensor calculus, as indicated earlier. On the observational and experimental side, general relativity largely 
depends in its verification and support on astronomical observations and related applications although it 
has also been endorsed by Earth-based and space-based experimental evidence according to its supporters 
(refer to § 9 for details). 


Problems 


1. What are the main types of force in physics? 

Answer: According to modern physics there are four types of fundamental forces (or interactions): 
gravitational, electromagnetic, weak and strong. We note that gravity may not be considered techni- 
cally a force according to some modern theories (including general relativity) although the term “force” 
may still be attached to gravity following the common notion and convention. This should also apply 
to other terms of classical gravity which are based on the paradigm of force (such as gravitational field 
and potential) where the use of these classical terms is generally tolerated in the literature of general 
relativity although they are technically incompatible with the theoretical framework and paradigms of 
general relativity. 


[3] Geodesic path or curve in a given space is the most straight path connecting two points in that space (see § 2.9). 

[4] Some exceptions like perihelion precession of Mercury (see § 8.1) may be considered. 

[5] Due to the mass-energy equivalence according to the relation E = mc?, matter may include energy as well as mass. It 
is noteworthy that there is a convention (mainly in cosmology) that massless objects are called radiation while massive 
objects are called matter. Also, there are similar other conventions (e.g. in particle physics and relativistic quantum field 
theories) about these terms. However, we do not strictly follow these conventions. More clarifications about these issues 
will follow. 

[6] In fact, this seems logical if we combine classical gravity (where gravity vanishes far away from the source of gravity due 
to the inverse square relation between gravitational field and distance) with the general relativistic idea that gravity is 
demonstrated by the curvature of spacetime and hence weak/strong gravity means weak/strong curvature and vice versa. 
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2. Why is gravitational force always attractive while electric force can be attractive and repulsive? 
Answer: This may be justified by having only one type of mass but two types of charge. However, 
the attraction (rather than repulsion) between masses still requires explanation. 

3. Is there any way to nullify gravity? 

Answer: There is no way to nullify gravity (or shield it off) in a real sense, i.e. turn it off so it ceases 
to exist. However, the effect of this force can be neutralized to a certain extent by free fall where the 
attractive force of gravity is not felt although it is still present as it is the cause of the fall.!7] In fact, 
the effect of gravity in free fall can be observed by an outside observer, i.e. an observer who is not 
in the frame of free fall. For example, the planets in their orbits around the Sun and the artificial 
satellites in their orbits around the Earth are in a state of free fall but the effect of gravity on them 
can be observed by an outside observer who observes their revolution due to gravity. 

4. What is the colloquial idiom that is commonly used to express the essence of general relativity? 
Answer: It is “matter tells spacetime how to curve, spacetime tells matter how to move”. 

5. Mention a factor (other than those given in the text) that distinguishes gravity from other forces and 
demonstrates its exceptional nature. 

Answer: It is the ability of gravity to provide some of the most efficient and powerful energy release 
mechanisms through the accretion of matter onto compact objects (see § 8.7). 


Exercises 


1. Explain why gravity, unlike electric force, is the dominant force on large scale although they have 
identical dependency on distance since both are subject to the same inverse square relation (classically 
at least). 
Compare mass and electric charge in their relation to gravity and electromagnetism. 
Compare, briefly, between gravity in classical physics and gravity in general relativity. 
Outline the need for general relativity and the factors that contributed to the emergence of this theory. 
According to general relativity, what is the characteristic feature of the world line followed by an object 
under the influence of gravity alone (and hence it is in a state of free fall)? Consider in your answer 
massless objects (such as light) as well as massive objects. 
6. Establish a connection between special relativity and general relativity with regard to the underlying 
spaces of these theories and the motion of physical objects in these spaces. 
7. Outline the main criticisms that faced Newtonian gravity and contributed to the emergence of general 
relativity. 
8. Briefly discuss the implications of the fact that general relativity is a geometric theory. 
9. What are the main pillars of general relativity? 
10. Explain the need for tensor calculus and differential geometry in the formulation of general relativity. 
11. In general relativity, what is the link between the doctrine of representing gravity by the curvature of 
spacetime and the redundancy of the concept of force (as applied to gravity)? 


OL Seu 


1.2 Historical Issues and Credits 


The tendency of massive objects to fall to the Earth was seen in the ancient philosophies (which are 
typically represented by the Aristotelian philosophy) as a natural inclination to come down to the center 
of the Universe which is believed to be the Earth. Hence, this tendency was seen as an obvious and 
self-explanatory phenomenon with no need for an elaborate theory to justify. Also, there was no reason 
to suspect that this tendency has any connection with the tendency of heavenly bodies, such as Mars, 
to follow certain routes or orbits in their perpetual voyage. With the emergence of the Copernican 
astronomical model there were some contemplations about possible existence of a “physical force” that 
pulls down massive objects towards the center of the Earth and may even be responsible for the observed 
trajectories of celestial bodies. For example, Galileo investigated the acceleration of massive objects under 
the influence of the “gravity” of the Earth’! where he observed the independence of the gravitational 


7] The purpose of “certain extent” is to exclude tidal forces which are not neutralized by free fall. 
[8] Galileo was one of the first scholars (but not the only scholar) in the era of Renaissance to investigate gravity in a systematic 
way using modern (but rather crude) experimental and observational methods instead of relying on philosophical reasoning 


1.2 Historical Issues and Credits 12 


acceleration from the mass of objects and this observation can be seen as the root of the modern principle 
of equivalence which is one of the pillars of general relativity (see § 1.8.2). 

However, the modern scientific notion of gravity and its quantitative formulation did not emerge until 
the formulation of a mathematically rigorous astronomical model represented by Kepler’s laws which are 
based on the elaborate and systematic observations of Tycho Brahe. These laws paved the way to Isaac 
Newton to formulate the first modern and scientifically sound theory of gravity (see § 1.5) as represented 
by his universal law of gravitation which provides qualitative and quantitative explanation not only to the 
fall of massive objects towards the Earth and the motion of celestial bodies along their trajectories but to 
many other phenomena that characterize the physics of the Universe on large scale such as the formation 
of stars and galaxies and the interaction between them.!®! 

Although the triumph of the Newtonian gravity theory (as well as its applications such as celestial 
mechanics) was stunning and hence it was seen by the majority of physicists over more than two centuries 
as the ultimate theory, there was continuous emergence of problems in the application of this theory to 
astronomical observations. In fact, these problems can be classified into two main categories: problems 
based on the limitation of the observation and hence they do not affect the theory itself such as the pertur- 
bation of some celestial trajectories from their theoretical predictions due to the presence of unobserved 
astronomical objects, and problems that seem to threaten the theoretical model itself due to the failure 
of the theory to provide convincing explanations such as the perihelion precession of Mercury. In fact, 
the first category endorsed, rather than damaged, the Newtonian theory by providing evidence for its 
predictive power such as the discovery of Neptune as a result of discrepancies observed in the orbit of 
other planets with the application of the classical theory of gravity. However, the second category cast 
a shadow on the integrity of the theoretical model itself although the majority of physicists seemed to 
believe that these failures of the Newtonian theory do not arise from the theory itself which is essentially 
sound and reliable but from some other factors and details, such as limitations in the observational and 
computational techniques, or from minor theoretical limitations and hence there were attempts to find 
solutions to these problems within the classical theory rather than dismissing this theory altogether and 
looking for a different theory. A prominent example of these attempts is the retarded gravitational po- 
tential theory!!°! whose main objective was to fix the problem of Mercury precession that was a major 
failure to the Newtonian gravity (see § 8.1). However, apart from the apparently undisputed fact that this 
theory produced the same precession formula as the one based on general relativity we have no sufficient 
historical records about this theory to reach a conclusion or judgment about its essence, validity and 
merit. 

There are also tentative historical records about contemplations and theories of gravity in the 19*” 
century that were based on the ether assumption or on hypothesized velocity dependent potentials, as 
well as contemplations and proposals about the finity of the speed of propagation of gravitational effects 
which should be based on assuming a temporal dependency that is missing in the Newtonian gravitation 
theory. There were also attempts to model gravity on the style of electromagnetism (represented by 
Maxwell’s equations) as a full field theory with temporal and spatial dependency. Although we cannot 
reach a decisive conclusion about these contemplations and theories and their contents and significance, 
we can deduce that many of the basic elements and ideas on which general relativity rests were existent 
even before the formal and complete development of this theory. In this regard, we should also refer to the 
history of the development of the equivalence principle, which is a fundamental pillar of general relativity, 
as one of the precursors of this theory (see § 1.8.2). 

Following the development of Lorentz mechanics and Einstein involvement in elucidating the theory 
of special relativity, which is restricted to inertial frames, he started an effort to generalize this theory 


which was the standard method of investigation in the middle ages. 

(91 In fact, we should also point out to the essential contributions of several predecessors and contemporaries of Newton (e.g. 
Erigena, Adelard of Bath, Galileo, Bullialdus, Huygens and Hooke) to the development and formulation of the classical 
gravity law especially the inverse square dependency on distance and its relation to Kepler’s laws. 

19] This theory is based on a modified classical approach and is commonly associated with the name of Gerber (who was 

a German school master) although Gerber may not be the originator of this theory or the main contributor to its 
development. 
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to include non-inertial frames.!"!] This in essence is a generalization of the invariance with respect to 
velocity to the invariance with respect to acceleration where inertial frames (of constant velocity) are 
considered as a special type of accelerating frames (of variable velocity). On contemplating the similarity 
(which became equivalence later on) between acceleration and gravitation the attempted generalization 
became an effort to find a theory of gravity where gravitational frames will be considered equivalent to 
accelerating frames to which the fundamental principles of Lorentz mechanics should apply in some sense. 
In other words, this generalization will ensure the invariance of physical laws across all frames of reference 
(i.e. not only across inertial frames as in Lorentz mechanics) where the rules of Lorentz mechanics should 
apply in a potentially restricted sense (i.e. locally but not globally as in Lorentz mechanics). 

Now, Lorentz mechanics as a theory of space and time and their transformation across frames is es- 
sentially a geometric theory!!?! and hence in this context it seemed natural for any invariant theory of 
physics to consider the geometry of space where this space was already generalized in Lorentz mechanics 
to include time, i.e. Minkowski spacetime. So, following several attempts in different directions it was 
concluded (rightly or wrongly) that the answer to the quest of an invariant general theory of physics lies 
in the investigation of the geometry of spacetime and its invariant properties. Accordingly, the math- 
ematical tools of differential geometry (whose subject is the analytical investigation of the geometrical 
properties of multi-dimensional spaces in general) and tensor calculus (whose subject is the formulation of 
invariant mathematical concepts and relations through the use of tensors) were seen as the ideal choice for 
developing such a general invariant theory of physics that should include inertial frames and accelerating 
frames as well as their equivalent gravitational frames. The persistent effort of Einstein and other scholars 
who worked closely with him was finally culminated by the formulation of the Field Equation of general 
relativity in 1915-1916 which marks the birth of this theory. 

Following the controversial endorsement of general relativity by the 1919 solar eclipse expedition of 
Eddington and his team, there was strong interest in the first few decades in finding applications to the 
theory in fields like cosmology and astrophysics. For example, there were many suggestions of general- 
relativity-based cosmological models for the evolution and destiny of the Universe. This may have been 
enhanced by the “discovery” of the expansion of the Universe, according to the Hubble’s law, and the 
emergence of the Big Bang theory. There were also proposals of general-relativity-based astrophysical 
theories related for example to compact objects like neutron stars and black holes. However, most of 
these investigations and proposals were theoretical contemplations and mathematical models and hence 
general relativity was largely seen as a mathematically aesthetic but not very practical theory. In fact, in 
the 1940s and 1950s general relativity seemed to be in its way to die out and become extinct. 

This trend was reversed in the late 1950s and in 1960s where general relativity restored its momentum 
and presence in the mainstream science. This, in large part, is due to the promotional effort by pro- 
relativity physicists like John Wheeler who revived the interest in this theory and popularized it in the 
scientific community and even among the wider general public. An auxiliary reason for this stimulated 
revival is the technological revolution in the computational methods and technology and the advancements 
in astronomical equipment and techniques. The availability of high computational power associated with 
very advanced and efficient numerical methods facilitated the solution of the Field Equation of general 
relativity in many cases where analytical solutions do not exist. Also, the advancement in the technology 
and methods of astronomical observation, including things like space missions and artificial satellites and 
the use of radio astronomy, enabled physicists and astronomers to search for applications and verifications 
to the theory in several fields of interest. 

Nowadays general relativity is the dominant theory in many fields of research especially in cosmology 
and astrophysics. However, general relativity did not get this privileged status by natural selection and 
fair competition but mostly by promotion and indoctrination where general relativity was hugely favored 


21) Th fact, generalizing special relativity (or Lorentz mechanics) was a natural objective at that stage in the development 
of science and hence Einstein was not alone in this effort. So, specifying Einstein is because we are investigating general 
relativity which Einstein is commonly given the main credit for its development. Otherwise, other physicists were also 
working on this project (as well as developing new gravitation theories) at that time although their attempts and theories 
are generally ignored in the modern literature. 

[12] This is particularly true when we consider the Minkowskian merge of space and time into spacetime. 
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while alternative theories were not allowed to survive and thrive. In general, the mainstream scientists 
(including academic, research and funding bodies) treat alternative theories in principle as inferior to 
general relativity and even as pseudo science. However, we should distinguish between the alternative 
theories which are totally independent of general relativity and those which are based on or derived from 
general relativity and hence in essence they are no more than alterations and variant forms of general 
relativity and so they generally rest on the same physical and epistemological principles and follow very 
similar or identical theoretical methods. The latter generally survive and thrive under the umbrella of 
general relativity and hence they usually enjoy the favoritism that is enjoyed by general relativity. In fact, 
there are many theories of the latter type although most of them do not deserve particular attention due 
to their lack of originality and independence. Anyway, this is not the subject of investigation of this book 
and hence we refer the interested reader to the relevant literature. 

Finally, let briefly discuss the credit for the development of general relativity which is generally believed 
to be the creation of Einstein alone (or at least the contribution of others to this theory is regarded 
insignificant compared to the contribution of Einstein). According to the available historical records, this 
view is biased in favor of Einstein because although Einstein was the main contributor to the pursuit of 
this theory it was impossible for him to develop this theory single handed due to the limitation on his 
mathematical knowledge and skills.!'#! In fact, there are other major contributors to the creation and 
development of this theory such as Minkowski, Grossmann, Levi-Civita and Hilbert. The contribution of 
these scholars has particular importance and significance with regard to the development of the formalism 
of this theory (as opposite to the qualitative and epistemological sides) which is the really scientific part 
of the theory. 

Apart from acquiring fundamental ideas and techniques (e.g. the 4D spacetime of Minkowski and 
his energy tensor), Einstein got major guidance and advice on essential mathematical and theoretical 
aspects from the aforementioned scholars (and possibly from others). Moreover, he got substantial help 
in correcting his numerous blunders (e.g. by Levi-Civita and Hilbert) and reorienting his search into the 
right direction. In fact, without the essential contributions of these scholars general relativity could not 
have reached its final conclusion. These contributions include even the formulation of the final version of 
the Field Equation as will be discussed later. In brief, it was impossible for Einstein to develop general 
relativity single handed especially in its formal and mathematical part which is the really scientific part of 
the theory. The reality is that Einstein was mainly skillful in digging, finding and developing qualitative 
ideas of philosophical and epistemological nature but he did not have the essential mathematical knowledge 
and skills to develop a highly complex mathematical theory like general relativity. In fact, this should be 
evident from the many basic formal and logical mistakes and contradictions that can be found in his work 
(e.g. in his special relativity paper of 1905) especially in that early period of his research career. 

It is also important in this context to note that general relativity (in its version which is commonly 
attributed to Einstein) is not the first metric theory of gravity and hence Einstein (and possibly some of 
his collaborators) is not the first to come with such an idea and formulate such a theory that links the 
metric to the energy-momentum tensor. In this regard, major credit should be attributed to Nordstrom 
for his gravitation metric theory of 1913 (regardless of its experimental validity or invalidity and the 
alleged contradiction with observation). This theory (like its successor general relativity) also correlates 
the metric of spacetime g,,, to the energy-momentum tensor T),,. In fact, Nordstrom was not the only 
one to search for and propose such a metric theory prior to the development of general relativity, as 
a quick inspection of the history of research in that era should reveal. Moreover, precursors of general 
relativity (in the form of generic gravitation metric theories) can be traced back even to earlier times where 
historical indications suggest such speculations and initial formulations in the work of earlier physicists 
and mathematicians such as Minkowski and Laue. The unfortunate fact is that the history of science of 
that era (especially with regard to this part of research) is severely distorted and falsified by the followers 
of Einstein and the enthusiastic propaganda that followed the alleged endorsement of general relativity in 


[13] In this regard, we refer to the quote by Minkowski (who was a mathematics teacher of Einstein): “The mathematical 
education of the young physicist [Albert Einstein] was not very solid, which I am in a good position to evaluate since 
he obtained it from me in Zurich some time ago”. We should also refer to the Einstein admission of his mathematical 
limitation and his struggle with mathematics during his school days. 
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1919 by Eddington and his team. 

Accordingly, we think substantial credit for developing this theory should go to the aforementioned 
scholars (as well as others) without whom “general relativity” will be no more than a collection of philo- 
sophical contemplations, qualitative physical ideas and basic (and potentially inconsistent) mathematical 
formulation that do not rank to the state of a scientific theory. It is a huge bias to consider the contri- 
butions of these scholars as something like consulting a textbook on a mathematical problem or seeking 
the advice of a mathematics or science teacher to solve a casual difficulty (as it is usually depicted in the 
literature). Their efforts and contributions in developing the theory are not less important or significant 
than the effort and contribution of Einstein, and this should be considered when re-writing the history 
of general relativity which is not less distorted than the history of special relativity (see our book “The 
Mechanics of Lorentz Transformations”). Regarding the dispute about the priority of developing the final 
form of the Field Equation and whether the credit should go to Hilbert or Einstein, we question the 
priority of Einstein in developing this equation and hence we think the credit (or at least the main credit) 
belongs to Hilbert (see exercise 6).!"41 

We should finally note that general relativity in its current state and extent is the product of the 
persistent effort of several generations of scientists around the world over more than a century. Hence, the 
theory in its current state is far from being “the theory of Einstein” (as commonly depicted in the literature) 
even if we accept that Einstein was the major contributor to the original proposition and formulation of 
the theory. The reality is that large parts of the formulation, interpretation, expansion, generalization, 
correction and refinement of the current theory belong to other physicists and mathematicians. However, 
despite this there is a general tendency in the literature to attribute the credit for the entire theory 
to Einstein, which is very far from reality. This, in fact, is part of the general trend in the literature 
to magnify and exaggerate the role of Einstein whenever and wherever this seems possible, which is a 
practice inconsistent with the ethics and spirit of science.!!*1 


Problems 


1. State Kepler’s laws. 
Answer: They are: 
(a) Planets follow elliptical orbits in their revolution around the Sun with the Sun being at one focus. 
(b) The line joining the Sun to the planet sweeps out equal areas in equal periods of time. 
(c) The cubes of the mean distances between the Sun and the planets are proportional to the squares 
of their orbital periods. This law may also be stated as: for any two planets, the ratio of the cubes of 
the semi-major axes of their orbits is equal to the ratio of the squares of their orbital periods. 

2. State Newton’s law of gravity. 
Answer: Two bodies attract each other by a force that is directly proportional to the product of their 
masses and inversely proportional to the square of the distance between their centers. 


Exercises 


1. State some limitations of the classical gravitation theory that motivated the development of general 
relativity. 

2. Discuss the twists and turns in the development of general relativity. 

3. Discuss the failure of Lorentz mechanics to include gravity under the invariance requirement (to be 
like mechanics and electromagnetism). 

4. Outline the main limitations of Lorentz mechanics (or special relativity) with regard to the require- 
ment of the invariance of physical laws and explain and assess the general relativistic methodology in 
addressing these limitations. 


[14] Due to the dispute about the contribution of Hilbert and Einstein in developing the final form of the Field Equation of 
general relativity, as well as the contributions of other scholars to the development of this equation, we prefer to call this 
equation “the Field Equation”, i.e. with uppercase initials and without the “Einstein” or “Hilbert” attachments which are 
used in the literature. 

[5] Tn fact, this practice of exaggeration and bias is not restricted to Einstein and general relativity (for example we have a 
similar situation with Newton and his theories) although the situation is more grave in the case of Einstein and general 
relativity. 
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5. Briefly outline the role of Einstein in the development of general relativity and in the development of 
a general theory of space and time (as well as gravitation metric theory). 

6. Why we question the priority of Einstein in developing the final form of the Field Equations of general 
relativity and attribute the main credit to Hilbert? 

7. Is general relativity the first metric theory of gravity? 


1.3. General Terminology 


In this section we provide brief definitions of essential concepts and terms that are frequently used in this 
book. 

e Massive or material object is a physical object that possesses finite mass (i.e. m > 0) such as a 
particle of sand, while massless object is a physical object with no mass (ie. m = 0) such as a photon. 
However, we should note that massless object does not have “rest” mass but it should have effective (or 
equivalent) mass according to the mass-energy relation, i.e. m = E/c? where m is the effective mass of 
the object, FE is its energy and c is the characteristic speed of light. So, massive object has rest mass while 
massless object has no rest mass although it has effective mass. 

e Spacetime is a combination of space and time in which objects do exist and physical events take 
place. In general relativity, spacetime is not seen as space plus time but as a single manifold in which 
temporal and spatial dimensions are treated equally as dimensions of a 4D manifold. The concept and 
formal treatment of spacetime in general relativity is as in special relativity apart from the fact that the 
spacetime of special relativity is flat (i.e. Minkowskian) while the spacetime of general relativity is curved 
(i.e. Riemannian). 

e Event is a physical occurrence in the spacetime manifold and hence it is represented by a single point 
in this manifold. Accordingly, an event is identified by a set of 4 spacetime coordinates: 1 temporal and 3 
spatial, e.g. (9,1, x?, x?) where «° represents the temporal coordinate ct (with c being the characteristic 
speed of light in free space and t is time) while z', x’, «°° represent the spatial coordinates. We may call 
(x°, 21, 2?,2%) the 4-position in accord with the upcoming terminology (e.g. 4-velocity). 

e Riemannian space is a manifold characterized by the existence of a symmetric rank-2 tensor called 
the metric tensor that describes the geometry of the space (see § 2.5). More accurately, Riemannian space 
is characterized by having an invariant line element ds of the following form: (ds)* = gi;dx'da, i.e. (ds)? 
is quadratic in the coordinate differentials as well as being dependent on a symmetric rank-2 metric tensor 
gi; which is generally a function of coordinates.!"4 

e Pseudo-Riemannian space is like Riemannian space in having a symmetric rank-2 metric tensor but, 
unlike Riemannian space, its metric (or rather the quadratic form of its line element) is not positive definite. 
This distinction is a mathematical convention and hence physicists generally do not follow this distinction 
because it is irrelevant to their purpose. Accordingly, the spacetime of general relativity is commonly 
described as Riemannian (in disregard to this convention) although it is actually pseudo-Riemannian. In 
fact, this is a matter of terminology more than an abolishment of the difference between Riemannian and 
pseudo-Riemannian which is a real difference (as we will see later). 

e In Riemannian space, the differential line element ds is defined by (ds)* = gi;dx'dxi where gj; is the 
metric tensor, dx’ and dx/ are coordinate differentials and i and j range over the space dimensions (noting 
that we are using the summation convention, as will be declared later). In this book, the line element of 
the spacetime of general relativity is symbolized with do to make it specific to general relativity.!!7] The 
square of the line element |i.e. (ds)” and (do)’] is called the quadratic form. 

e In Riemannian space, the space interval is the length of a general path (whether straight or curved) 
of a finite size connecting two points in the space and hence it is the integral of the line element ds (as 
defined in the previous point) along the path, that is: 


s= | as (1) 


16] As we will see, other conditions (e.g. being invertible) are also imposed on the metric tensor of Riemannian space. 
117] In fact, this also applies to the line element of the spacetime of special relativity and hence it is specific to the relativity 
theories. 
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where s is the space interval and the integral is evaluated over the path P which connects the two points. 
This definition applies to the spacetime of general relativity!!*| (which is a pseudo-Riemannian space) and 
hence the spacetime interval o is defined as the integral o = { do. However, we may stretch the term 
“spacetime interval’ to include even its infinitesimal form do (which is the line element). 

e A space is flat if it is possible to find a coordinate system for the space with a diagonal metric tensor 
whose all diagonal elements are +1; otherwise the space is curved. More formally, an nD space is flat iff 
it is possible to find a coordinate system in which the quadratic form (ds)? is given by: 


n 


(ds)? = $7 Gi(da")? (2) 


i=1 


where the indexed ¢ are +1 while the indexed zx are the coordinates of the space. 

e Minkowski space, or Minkowski spacetime, is the 4D spacetime manifold that underlies special rel- 
ativity and hence it is flat and void of matter and energy. In fact, it is void of matter and energy as a 
source of gravity (i.e. in considerable quantities that distort the metric properties of the spacetime) but 
it could (and usually should) contain some matter and energy. 

e 4-vector means a vector (in its technical tensorial sense) in the 4D spacetime. Similarly, 3-vector 
means a vector in the 3D ordinary “spatial” space. Terms like 3-tensor and 4-tensor may also be used 
to refer to tensors in 3D space and 4D spacetime. This also applies to terms like 3-operator, 4-operator, 
3-quantity and 4-quantity where quantity is a given physical quantity (e.g. 3-velocity and 4-momentum). 
It should be noted that the “4-” label generally means invariance under the presumed transformations in 
the spacetime (i.e. Lorentz transformations in the Minkowskian spacetime of special relativity and general 
transformations in the Riemannian spacetime of general relativity although in the latter case the label 
usually refers to the local Lorentzian origin of the quantity). 

e Geodesic curve is the most “straight” or most “direct” route connecting two points in a given space. 
Geodesic curves are normally, but not necessarily, the curves of shortest length. 

e Null geodesic is the geodesic path that is followed by a light signal (or any massless object to be 
more general) in the spacetime (assuming the signal is not under non-gravitational influence). The line 
element do (and hence the spacetime interval 7) vanishes identically along null geodesic and that is why 
it is “null”. 

e Free particle (or object) is a particle that moves in the spacetime under no influence at all (as in 
the spacetime of special relativity) or under the influence of gravity alone (as in the spacetime of general 
relativity). In fact, if we consider the inertia as well then we may say: a particle is free (according to 
general relativity) if it is under the influence of only inertia or/and gravity (i.e. it is not under the influence 
of any force). This should be appreciated in the context of the principle of equivalence (see § 1.8.2) where 
inertia and gravity are unified. 

e Inertial mass is a quantitative measure of the tendency of massive objects to resist any attempt by a 
foreign agent to change their state of rest and motion (i.e. being accelerated). Accordingly, we need to 
apply a physical force to change the state of a massive object from rest to motion or from motion to rest 
or change its speed or/and direction of motion. The inertial mass is formally represented (and defined in 
a sense) by Newton’s second law of motion and hence it is associated with acceleration. 

e Gravitational mass is a quantitative measure of the pulling force (or attraction) that a massive object 
experiences in the presence of another massive object. Accordingly, in the presence of a given massive 
object A, object B which is made of certain amount of matter (i.e. it has a given gravitational mass) will 
experience a stronger attractive force than object C that is made of a smaller amount of matter (i.e. it 
has a smaller gravitational mass) assuming that the distance between A and B and the distance between 
A and C are equal. The gravitational mass is formally represented (and defined in a sense) by Newton’s 
law of gravitation and hence it is associated with gravitational field. 

e Tidal force in the context of gravity means a force that originates from a gradient in the gravitational 
field due to its non-uniformity and hence different parts of an object in such a field experience forces of 


[18] In fact, this also applies to the spacetime of special relativity (see exercise 5). Also, due to physical considerations the 
sensibility of this may be restricted to timelike intervals (or at least lightlike intervals are excluded because they are null). 
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different strength or/and direction and this results in physical effects that tend to distort the object such 
as stretching the object along a given orientation. In fact, we may define tidal force more generally and 
succinctly as differential force across a distance that spans one (or more) physical object. 

e Gravitating object or body means a source of gravitational field while gravitated object or body 
means a body affected by (or being under the influence of) the gravitational field of the gravitating object. 
We usually use the “gravitating object” term to refer to the larger (i.e. the more massive) of the two objects 
while we normally use the “gravitated object” term to refer to the smaller object. So, in the gravitational 
relation between the Sun and the Earth the Sun is the gravitating object while the Earth is the gravitated 
object. We may also refer to such two objects as gravitating objects or gravitating system. 

e We commonly use the term “other forces” to exclude gravity although gravity is not a force in a 
technical sense according to general relativity. 

e We may use free fall gravitation to mean motion in the absence of any force other than gravity. 
However, there are some details about this issue which will be discussed in the Exercises. 

e Test particle (or test object) is a massive object whose mass is so tiny that it does not have tangible 
effect on the gravitational field that influences the test particle. 

e Weak gravitational field may be defined as a field in which the gravitational potential energy of a 
test particle is negligible in comparison to its mass energy while strong gravitational field is a field in 
which the gravitational potential energy is comparable to the mass energy (also see exercise 19 of § 4.1.1). 
As we see, there is no sharp border between the two and hence what is weak and what is strong may 
depend on the case and context. It is generally accepted that Newtonian gravity is a valid approximate 
theory in weak fields but not in strong fields although this should depend on the case and context as well 
as other factors (as will be investigated later). 

e Local means a confined part of space (or spacetime) over which the effects of curvature, non-linearity, 
non-uniformity, non-homogeneity, non-isotropy, etc. are negligible. In this context, we should note that 
terms like “locally flat” (which frequently occur in the literature of relativity including this book) may 
be used in two meanings: as approximation (and hence it applies at any location even in curved spaces 
assuming they are Riemannian) and as strict condition and hence it applies only to the strictly flat parts 
of space (e.g. plane that is smoothly connected to a semi-sphere). The technical distinction between the 
two (within the context of the relativity theories) is that in the first case the metric of the spacetime region 
is given by guy ~ Nav + Ep (where 7, is the Minkowski metric tensor and ¢,,, is a perturbation tensor 
such that |e,,,| <1) while in the second case the metric of the spacetime region is given by guy = Nv. 


Problems 


1. What is the relation between Minkowskian space and Riemannian space? 
Answer: Minkowskian space is an instance (or special case) of Riemannian space (or rather pseudo- 
Riemannian space). In fact, Minkowskian space is a flat 4D pseudo-Riemannian space (or rather 
spacetime). 

2. Define inertial and gravitational mass in a few words. 
Answer: Inertial mass is a measure of the resistance to kinematical change (i.e. the state of rest and 
motion) while gravitational mass is a measure of the strength of gravitational attraction. 

3. Classify space intervals in pseudo-Riemannian space. 
Answer: In pseudo-Riemannian space, space intervals are classified (according to the sign of their 
quadratic form) as timelike when (ds)* > 0, lightlike when (ds)” = 0, and spacelike when (ds)” < 0. 
We note that the classification with regard to the sign of the quadratic form may be reversed |i.e. 
timelike when (ds)* < 0 and spacelike when (ds)? > 0] depending on the adopted convention about 


the sign pattern of the metric like +4 or — +++ (similar to the situation in special relativity 
where the Minkowski metric may be given as diag [+1, —1, —1, —1] or as diag [—1, +1, +1, +1]). In this 
book we follow the + ——-— sign pattern and hence (ds)” > 0 for timelike and (ds)* <0 for spacelike. 


Note: an issue related to the above classification is the classification of vectors where a vector (e.g. 
V) is classified (according to the sign of its square which is given by the inner product V-V) as 
timelike when V-V > 0, lightlike when V-V = 0, and spacelike when V-V < 0. For example, 
the velocity 4-vector U is timelike because we always have U-U = c? > 0 (see our book “The 
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Mechanics of Lorentz Transformations”). Also, a trajectory (or curve or world line) is described as 
timelike /lightlike/spacelike when its tangent vector is timelike/lightlike/spacelike. 


Exercises 


1. Make a brief comparison between the space of special relativity and the space of general relativity. 
2. Clarify the issue of line element and quadratic form in general nD space and in the spacetime of general 
relativity. 
What is the essence of the definition (which is given in the text) of flat space? 
State some of the distinctive properties of pseudo-Riemannian spaces. 
Give a detailed and formal definition of “spacetime interval” in special relativity. 
What “free particle’ means in special relativity and in general relativity? What is common to both? 
Also, link this to the concepts of inertia and gravity and hence find a more fundamental common 
factor. 
7. Give examples of massive free objects (i.e. under the influence of inertia or/and gravity alone) in the 
spacetime of general relativity. 
8. Is there a difference between “free” and “freely falling”? 
9. Compare inertial mass and gravitational mass. 
10. What is the relation between inertial mass and acceleration? 
11. Give brief definitions of the following terms: pseudo-Riemannian space, null geodesic, tidal force, free 
fall gravitation, and test particle. 
12. Give a mathematical condition that distinguishes between Riemannian space and pseudo-Riemannian 
space. 
13. Give some examples of the significance of the locality condition in various physical contexts. 
14. Give common examples of how curved spaces of various dimensionality look flat locally. Also justify 
your findings. 
15. Give mathematical definitions for “locally Euclidean space” and “locally flat spacetime”. Also explain 
“local Cartesian coordinate system” and “local inertial frame”. 
16. Use the answer of exercise 15 to give mathematical conditions for local flatness in Riemannian and 
pseudo-Riemannian spaces. 


PoP 


1.4 General Conventions, Notations and Remarks 


In the following points we provide some notes about the conventions and notations that we use in this 
book. We also include general remarks that we need for future investigation. 

e Due to the restrictions on the size of the book, some additional materials (like some non-essential proofs) 
are not included in this book. However, these additional materials are mostly given in our other books. 
So, for completeness and to satisfy the need of the readers who are interested in these details, we make 
frequent references to our books using the abbreviations B2, B3, B4, B2X, B3X, B4X where we linked 
these abbreviations to the books in the References of this book. 

e We use the summation convention in the tensor formulations and hence a twice-repeated index in a 
tensor term like i in A;A* means summing over the range of i. 

e The Latin indices generally range over 1,2,3 while the Greek indices generally range over 0,1, 2,3. The 
Latin indices usually represent the three spatial coordinates while the Greek indices represent the four 
spacetime coordinates (one temporal indexed with 0 and three spatial indexed with 1, 2,3). However, the 
Latin indices may also be used for general nD spaces of non-specific dimensionality (see for example § 2). 
e The mathematical quantities in this book are generally real, i.e. not imaginary or complex. Accordingly, 
all the arguments of real-valued functions that are not defined for negative quantities, like square roots 
and logarithmic functions, are assumed non-negative by taking the absolute value, if necessary, without 
using the absolute value symbol. This is to simplify the notation and avoid potential confusion with other 
notations. So, \/g means /|g| and In(g) means In(|g|). 

e We use “classical mechanics” in a more general sense than that we used in our book “The Mechanics of 
Lorentz Transformations” (i.e. B4) by including classical gravity as represented by Newton’s gravitation 
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theory. We also use “classical gravity” (or similar tags) to refer to the Newtonian gravity. Also, “classical” 
may be used occasionally as opposite to quantum mechanical (and hence even the relativity theories are 
classical in this sense). 

e Although we frequently express personal opinions, the book in general is based on the common views 
held by the mainstream scientists and presented in the mainstream literature of general relativity (or at 
least they are obtained from an objective analysis of the theory and its formalism). Therefore, the views 
and style of presentation in the book should not be interpreted as personal views. In fact, some of these 
contradict our views as expressed in the present book and in our previous books. 

e We provided a Nomenclature list (in the front of the book) which the reader is advised to consult when 
needed for interpreting the mathematical symbols and abbreviations. Therefore, we do not provide such 
explanations within the text systematically although we usually explain the symbols at the place of their 
first occurrence or when there is a potential confusion with other symbols. 

e Because there are many symbols in this book, due to its highly mathematical nature, and because we 
prefer to use the conventional symbols that are commonly used in the literature of this subject, some 
symbols are used in more than one meaning, e.g. G for the gravitational constant and for the trace 
of Einstein tensor. Therefore, to avoid confusion we generally explain the appropriate meaning of such 
symbols within the text although we might rely on the context for clarification. 

e We generally use “space” in its technical mathematical sense and hence the 4D spacetime is a space in 
this sense. However, “space” may also be used occasionally to mean the ordinary 3D “spatial” space. The 
meaning should be generally obvious from the context. 

e “Matter” is usually a label for massive objects. However, for brevity “matter” may include energy in 
some contexts where the meaning is obvious. This is justified by the equivalence between mass and energy 
and the fact that the source of gravity in general relativity includes energy and is not restricted to mass 
as it is the case in classical gravity. 

e We use “gravitational frame” or “gravitating frame” to refer to a frame of reference in which gravitational 
fields do exist, i.e. the spacetime in which the frame exists is not void of matter and energy and the frame is 
at a finite distance from the source of gravity. These tags may also be restricted to frames in gravitational 
fields but they are not in a state of gravitational free fall. 

e Unlike our book “The Mechanics of Lorentz Transformations” where we used “Lorentz mechanics” sys- 
tematically instead of “special relativity”, we commonly use special relativity in the present book to refer 
to this subject due to the familiarity of the readers with this term as well as the correspondence with 
general relativity. Moreover, we made our point in that book and there is no strong reason to do the same 
in this book. 

e Unlike our previous books where we used different notations for the coordinates of Cartesian and general 
coordinate systems, in this book we use contravariantly-indexed zx (e.g. x’) for both and the distinction (if 
needed) will rely on the context if no explicit statement is made. We may also distinguish the Cartesian 
coordinates by indexing them with lower indices (e.g. 2;) since in Cartesian systems the covariant and 
contravariant types are equivalent. However, in a few cases the lower index is used to distinguish the 
covariant form of general coordinates. 

e We use “the Field Equation” as singular or plural depending on the context and convenience where the 
singular refers to its symbolic or tensorial form while the plural refers to its components. This also applies 
to other tensorial equations like the geodesic equation. 

e Physical quantities (such as energy and momentum) in this book are generally relativistic (or what we 
call Lorentzian) unless it is stated or indicated otherwise. However, the context should be considered to 
determine the ultimate meaning. 

e “Radiation” in general relativity and cosmology is commonly used to refer to massless particles and 
hence it is not restricted to electromagnetic radiation. We may use this without clarification relying on 
this convention as well as context. 

e For simplicity and convenience, in the case of rank-2 tensors we commonly use tensor and matrix (which 
represents the tensor) interchangeably. However, the matrix symbol is usually distinguished by square 
brackets, e.g. the tensor g;; is represented by the matrix [gi,]. 

e We generally follow the common convention of using the indicial notation of tensors to symbolize and 
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represent the components of tensors as well as the tensors themselves. For example, A* can represent the 
contravariant type of a vector A as well as its i‘” component. The meaning should be obvious from the 
context if no explicit explanation is given (e.g. the vector A’ or the component A’). The rules of index 
pattern may also be used as a good intuitive indicator (although not always definite) to the nature of 
expressions, equations and relations, e.g. A’ ik = Bi. should suggest a tensor equation while Ajj = Bj; 
or Al = Bi must be a component relation between different tensors. 

e Regarding the mathematics of tensors, we generally follow the old school approach where tensors and 
their main characteristics are generally distinguished by the type and pattern of indices and hence we 
talk for example about covariant and contravariant basis vectors and tensors avoiding the mathematics 
and terminology of differential forms and dual vectors. This is to avoid unnecessary complications and 
needless overheads. Moreover, in our view the old approach is more intuitive and physical in spirit than 
the mathematically-oriented modern approach. 

e The common convention in the literature of relativity theories is to write the quadratic forms as ds? 
and do”. However, to avoid confusion we do not follow this convention. Instead, we write them as (ds)? 
and (do). 

e The term “acceleration” (and its alike like accelerating) may be used in its general sense to mean a change 
of the state of a physical object from rest to motion or from motion to rest or change of its speed or/and 
direction of motion. It may also be used to mean temporal rate of increase of velocity in translational 
motion specifically and hence it excludes for example deceleration and rotation. To avoid complications 
in the phrasing and presentation, we commonly rely on the context to determine the intended meaning. 
e We use “relativity theories” in this book to mean the theories of special and general relativity specifically. 
e The formulae, physical constants and calculations are generally given and performed in standard SI 
units (Systéme Internationale d’Unités). 


1.5 Classical Gravity 


The classical concept and formalism of gravity are embedded in the Newton’s law of gravitation which 
states that any two massive objects attract each other by a force that is directly proportional to the 
product of their masses and inversely proportional to the square distance between them. Formally, this 


law is expressed as: 
Gmyme 


jm (3) 


where f is the magnitude of the attractive force, G (~ 6.674 x 107!! Nm?/kg”) is the gravitational 
constant,!!91 m4 and mz are the masses of the two objects and d is the distance between them. We note 
that “attract each other” indicates that the direction of the force is along the line joining the two objects. 

The above gravitation law represents classical gravity in its simplest scalar form assuming, for instance, 
that the masses are point-like (or particles) with no spatial extension. For more general and technical 
form of classical gravity, where gravity is formulated as a field theory with the use of the concepts and 
techniques of calculus, we re-cast the above equation in a vector form where the gravity is assumed to 
be between a spherically-shaped gravitating body of mass M of spherical symmetric distribution and a 
gravitated test particle of mass m with the use of a normalized spherical coordinate system whose origin 


is at the center of M, that is: 
GmM 


f= 3 Cr (4) 


r 


where f is the gravitating force (i.e. the force that M exerts on m),!2° r is the radial distance of m from 
the origin of coordinates (which is at the center of MM) and e, is a unit radial vector. The minus sign is 
because the force of gravity is attractive and hence it points towards the origin. 


119] Tt is noteworthy that G (according to some theories) may vary in space and time and hence it is a function of spacetime 
coordinates although such variations (if they really exist) should be negligible for most common purposes and hence it 
is essentially constant. 

[20] For simplicity, we use M and m to refer to the gravitating and gravitated objects (i.e. not only to their masses). 
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Next, we define the “gravitational force field” g = g(r) (with r being the position vector) as the 
gravitating force per unit mass of the test particle (and hence g is a vector field), that is: 


£= => ee (5) 


f 
m 
We then define the “flux” of the gravitational field as its area integral over a spherical surface S that 
encloses the entire M, that is: 


Flux = ‘c g-ndA (6) 
Ss 


where n is a unit vector that is normal to the surface and pointing outward and dA is an infinitesimal 
area element. Now, if we choose this surface to be the surface of M/, which is a sphere of radius R, then 


we have: 
GM 
[ie-naa [ (-Sre-] ‘er, dA (7) 


p=20 
= iz ft SER? sind dd 
a g= 


p=20 
[- “ih —GM sin6 d0 do 
0=0 J¢=0 


= 
— | —27GM sin 6 dé 
0 


I 

<a 
|S 
—E 
~ 


= -—-47rGM 


where in line 1 we use Eq. 5 (with r = R), line 2 is because e,. is a unit vector, in line 3 the generic area 
integral over the sphere of radius R is converted to its explicit form in terms of spherical coordinates, and 
in lines 5 and 6 we perform and evaluate the integrals over ¢ and 6 respectively. It should be obvious that 
M inside the integral is independent of 6 and ¢. 

Now, by the divergence theorem we have: 


[e-nda= fv eav (8) 


where 2 is the region of space occupied by the gravitating object and dV is an infinitesimal volume 
element. On comparing the last two equations we get: 


| Vipav ==meM (9) 
a 


To eliminate the integral (which can be annoying in manipulation and calculation) we express M as an 
integral and hence we have: 


[ v-8av =-a0c [ pdV (10) 
Q Q 


where p (which is a function of r alone) is the mass density of the gravitating object. Now, since this 
equation is valid regardless of the form of dV and the system that we use, then we can conclude: 


V-g=-—41Gp (11) 


The last equation, which states that the divergence of the gravitational field is proportional to the mass 
density of the gravitating body, is valid in general although it is derived under certain restricting conditions 
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and assumptions, e.g. spherical shape of gravitating body and spherical symmetry of its mass distribution. 
This also applies to the following equations that are derived from this equation.|!?#] 

Now, although Eq. 11 is essentially a scalar equation (since the divergence of vector is scalar) it still 
involves vector quantities and operations. It is advantageous in applications to have a purely scalar form of 
this equation and hence we will do that to Eq. 11. We note first that the gravitational field is conservative 
and hence it can be expressed as a gradient of a potential field ® (which is the gravitational potential 
energy per unit mass), that is: 


g=-Vv® (12) 

Accordingly, Eq. 11 becomes: 
V-(-V®) = —4nGp (13) 
V’h = 4nGp (14) 


where in line 2 we use the definition of the Laplacian operator V? as the divergence of gradient. The last 
equation, which is the Poisson equation for gravity, is the scalar form (or rather purely scalar form) of 
the Newtonian gravity as a field theory. 


Problems 
1. Find the magnitude of the Newtonian gravitational force between two point masses with m, = 2.5 kg 


and mz = 10° kg which are separated by a distance d = 10° km. 
Answer: We have: 
_ Gmyme 6.674 x 10714 x 2.5 x 10° 


me, a —21 
fa 5 _ ~ 1.669 x 10-74N 


2. What “conservative field” means? 
Answer: It is a field whose line integral is independent of the path, i.e. the value of its line integral 
depends only on the value of the field at the initial and final points of the path. 


Exercises 


1. Find the gravitational force field on the surface of the Earth (assuming it is a perfect sphere with 
spherically symmetric mass distribution). 

Obtain a mathematical expression for the gravitational potential ®. 
Give two properties of conservative vector field. 

Prove that the gravitational field is conservative. 

Verify the divergence theorem for the gravitational field of a gravitating body assuming it is a perfect 
sphere with spherically symmetric mass distribution. 


[22] 


OP WN 


1.5.1 Planetary Motion 


Planetary motion in classical physics is part of the celestial mechanics which is based on Newton’s laws of 
motion and Newton’s law of gravity. However, from a historical perspective planetary motion is described 
by Kepler’s laws which can be loosely stated as follows: 

(a) The orbits of planets have elliptical shape with the Sun being at one focus. 

(b) The areal velocity of planets is constant.|?3l 

(c) The squares of the orbital periods of planets are proportional to the cubes of the distances of the 
planets from the Sun. 

It can be shown (see Problems) that these laws are direct consequences of Newton’s laws (supported by 
certain definitions, conventions and assumptions as well as simple mathematical and physical facts). 


[21] The generality may be inferred from the linearity (and hence superposition) plus the fact that any gravitating body can 
be considered as an assembly of particles each of which obeys the above equation (and hence the following equations). 

[22] Tn questions like this we assume a single gravitating object with spherically symmetric (or point like) mass distribution. 

[23] The areal (or sectorial) velocity (or speed) is the temporal rate of change of the area swept by a line joining an object 
to a given point in space. In planetary motion, the object is the planet while the given point is the center of the Sun. 
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In fact, planetary motion in classical mechanics is described by a system of two equations which are 
derived from Newton’s laws (see Problems).?4! These equations are: 


deu GM 
ee ar (15) 
dp 2 


where u = 1/r with r being the radial distance in plane polar coordinates, ¢ is the polar angle in these 
coordinates, G is the gravitational constant, M is the mass of the gravitating object (i.e. the Sun in 
our case assumed to be at the origin of coordinates), B is a constant (which represents the magnitude of 
angular momentum per unit mass or twice the areal speed), and ¢ is the time. 
Using the above system of equations, it can be shown (see Problems) that the shape of the planetary 
orbits is described by the following equation: 
e/A 


sp 1 
. ecos¢+1 (17) 


where A is a positive constant (with physical dimensions of reciprocal length) and e = a This is a 
standard equation of an ellipse (with eccentricity e) in polar coordinates; a fact that is inline with the 


statement of Kepler’s first law (see Problems for details).|?5] 
Problems 


1. Use Newton’s law of gravity (supported by Newton’s second law) to develop a system of equations 
that describe planetary motion (and hence show that it is the system given by Eqs. 15-16). 
Answer: We use a plane polar coordinate system (r,¢) where the Sun (i.e. the gravitating body to 
be more general) is at the origin of the coordinate system, r is the radial distance from the Sun to the 
planet (i.e. gravitated body to be more general), and ¢ is the polar angle (with ¢ = 0 corresponding 
to the perihelion position).!?6| Accordingly, the radial position r (noting that we are dealing with 
radial force), the velocity v and the acceleration a of the planet are given by (refer to mathematical 
textbooks): 


v= es Te, + roeg 
a=ve= (i - ré°) e, + (rd + 26) eg (18) 


where the radial distance r and the unit vector e, are functions of ¢ (and hence functions of time due 
to revolution) and the overdot symbolizes derivative with respect to time t (i.e. d/dt).?71 Now, the 
gravitational force is given by Eq. 4, that is: 


GmM 
f = - a er 
r 
GmM 
ma = — e, 
r2 
GM 
a= eT rs (19) 


[24] Bq. 15 can also be derived from the conservation principles of energy and angular momentum. 

[25] We are considering the case of ellipse based on physical considerations related to the limits on the eccentricity of the 
planetary orbits; otherwise the above formulation is more general. 

[26] This coordinate system can be seen as the 2D section of a 3D spherical coordinate system in the plane 6 = m/2 (refer to 
§ 1.5 and § 5.2). In fact, we are assuming that the planetary orbits are planar; a fact that is justified classically because 
gravity is a central force field (which is also inline with the conservation of angular momentum in the absence of torques). 

[27l In fact, we can write: r = r(@(t)) and similarly e, = e; (¢ (t)). 
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where m and M stand for the mass of planet and Sun respectively and where in line 2 we use Newton’s 
second law of motion. On comparing Eq. 18 with Eq. 19 we obtain: 


es : GM 
fore? = - 72 (20) 
ro+2r¢ = 0 (21) 
Now, if we multiply Eq. 21 by r we get: 
ro + arid = 0 
d/ >; 
aco 


where in line 2 we use the product rule of differentiation. So, if we integrate the last equation with 
respect to t we get: 


r¢=B (22) 
where B is a constant (with physical dimensions of angular momentum per unit mass or areal speed). [28] 
Accordingly, the system of Eqs. 20 and 21 becomes: 
. GM 
a 2 
f-r¢ = - 2 (23) 
rd = B (24) 


Now, if we introduce a new dependent variable u = 1/r (and hence r = 1/u) then we have: 


dr 1 du 1dudg du ie ptt 
di wd wddd de ° dé 


where in step 1 we use the power and chain rules (or composite rule), in step 2 we use the chain rule, 
step 3 is just a notational modification, and in step 4 we use Eq. 24. Similarly: 


d?r = d (F) _ d ( Bs) = Bf (F) 
dt? dt \ dt dt dp dt \d@ 
a B d (3) do 
dt de \db) di 
d?r = plu do 
dt? dd? dt 
er du B 
de. age ge 
- 5 peu 
r = —-B Ie (25) 


where in line 2 we use the chain rule, in line 4 we use Eq. 24, and in line 5 we use the dot notation 
and u = 1/r. On substituting from Eq. 25 into Eq. 23 (and using u = 1/r) we get: 


d?u 1. 
2: 2, 2 —s 2 
d? 1 
— Bh og — Be = —GMw 
U 
d?u _ GM 
age = pee 


[28] With a proper choice of coordinates, ¢ is positive (as well as r?) and hence B is positive. 
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where in line 2 we use Eq. 24, and in line 3 we divide by —B?u?. Accordingly, the system of Eqs. 23 
and 24 becomes: 


au GM 
do 2 
aide B 27 
7 u (27) 


which is the same as the system of Eqs. 15-16. The last two equations represent the classical system 
of equations that describe the orbital motion of planets (and indeed the classical orbital motion in 
general) and hence the classical solution of the problems of planetary motion requires solving this 
system of equations (refer to the next problems and see § 5.2). 

2. From the answer to problem 1, show that Kepler’s first law is a consequence of Newton’s laws. 
Answer: To show that Kepler’s first law is a consequence of Newton’s laws it is sufficient to show 
that Kepler’s first law can be derived from the system of equations that we developed in problem 1 
(i.e. Eq. 26 to be more specific) since this system is obtained from Newton’s laws. The solution of Eq. 
26 is:!?9 

w= Acoso + 22 (28) 
(with A being a positive constant) as can be easily verified by substituting from the last equation into 
Eq. 26. From the last equation we get: 


1 GM 
Ais, Sof aay 
= cos @ + B? 
: (29) 
ro o= —W—; 
Acos@+ ch 
B? 
GM 
r= pel (30) 
Gm coset 1 
A 
pilot (31) 
ecos¢+ 1 
where in line 1 we use u = 1/r, in line 2 we take the reciprocal of both sides, in line 3 we multiply 
the numerator and denominator by pie and in line 4 we symbolize ae with e. The last equation is 


a standard form of an ellipse with eccentricity e in polar coordinates where the origin of coordinates 
is located at one focus (refer to mathematical textbooks). This shows that the planetary orbit has 
elliptical shape with the Sun being at one focus (as stated by Kepler’s first law) and hence Kepler’s 
first law is a consequence of Newton’s laws. 

Note: the condition “with the Sun being at one focus” is implicit since the above derivation is based on 
choosing a coordinate system whose origin is located at the Sun. We should also note that with some 
minor amendments to the above equations and conditions the case of circular orbit may be included 
as a special case of elliptical orbit corresponding to e = 0 where the orbital equation becomes r = bu 
(which corresponds to A = 0 in Eq. 28). 

3. Show that the perihelion distance rg and the aphelion distance r; are given respectively by:!°° 


B? d l+e 
ro = an rp=n7 
Y= GM Ue) PSD See 
Answer: As seen in problem 2, the equation of the planetary orbit is (see Eq. 29): 
1 


eS me 
Acos@+ Gh 


[29] Tn fact, this is “a solution” and not “the solution” although it is chosen as “the solution” due to physical and practical 
considerations. 

[30] Perihelion (aphelion) is the point in the orbit where the planet is closest to (farthest from) the Sun. Hence, the perihelion 
(aphelion) distance is the minimum (maximum) distance between the planet and the Sun. 
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Now, since A > 0 and S¥ > 0 then r is minimum (i.e. r = ro) when cos ¢ = 1 (since the denominator 
is maximum when cos ¢ = 1) and r is maximum (i.e. r = r,) when cos ¢ = —1 (since the denominator 
is minimum when cos ¢ = —1)./3!! So, if we use another given form of the equation of planetary orbit 
(see Eq. 30), i.e. 
B? B? 
po GMM Be 
AB’ coso +1 ecos¢+1 GM (1+ecos¢) 
then we have: 
B? 
"0 ~ GM (ite) 
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4. From the answer to problem 1, show that Kepler’s second law is a consequence of Newton’s laws. 
Answer: As in the answer of problem 2, we use the system of equations that we developed in problem 
1 (i.e. Eq. 27 to be more specific) to show that Kepler’s second law is a consequence of Newton’s laws 
since that system is obtained from Newton’s laws. Now, the infinitesimal area dA swept by the line 
joining the planet to the Sun in an infinitesimal time interval dt is given by: 


dA = sede 


where d¢ is the infinitesimal polar angle swept during dt. So, if we divide the last equation by dt we 
get: 
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where in line 2 we use r = 1/u, and in line 3 we use Eq. 27. Now, since B is constant then the last 
equation means that the areal velocity of planet is constant (as stated by Kepler’s second law) and 
hence Kepler’s second law is a consequence of Newton’s laws. 

Note: Eq. 32 shows that the areal speed is constant. The constancy of the areal velocity can then be 
concluded from the fact that the orbit is planar as indicated earlier. In fact, we are assuming that the 
original statement of the Kepler’s second law is about areal velocity rather than areal speed; otherwise 
we do not need to show the constancy of direction. 

5. Show that Kepler’s second law is a demonstration of the principle of conservation of angular momentum. 
Answer: As demonstrated in problem 4, Kepler’s second law can be derived from Eq. 27. So, to show 
that Kepler’s second law is a demonstration of the principle of conservation of angular momentum 
we just need to demonstrate that Eq. 27 implies the conservation of angular momentum. Now, if we 
multiply Eq. 27 with m (i.e. the mass of planet) we get: 
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[31] Tn fact, this originates from our choice of the coordinate system where the solution u = Acos¢ + GM is based on the 


condition that ¢ = 0 corresponds to the perihelion position (as stated in problem 1). We should also remark that the 
condition A > 0 (which we stated previously with no justification) can be justified by the fact that r > 0 and hence from 


Ho > 0. Now, since for ellipse 0 < e < 1 then (ecos¢+1) > 0 and hence A must be greater than 


zero if r to be greater than zero. 


Eq. 31 we get 
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where in line 2 we use u = 1/r. Now, according to classical mechanics mr? se is the magnitude of 
the planet angular momentum, and since m and B are constants then the last equation expresses the 
conservation of angular momentum (at least its magnitude). Again, the constancy of direction can be 
concluded from the planar nature of the orbit. 

. From the answer to problem 1, show that Kepler’s third law is a consequence of Newton’s laws. 
Answer: In the answer to the present problem we use a few elementary mathematical facts as well 
as the results that we obtained already in the previous problems which are derived from the system 
of equations that we developed in problem 1, and since that system is ultimately based on Newton’s 
laws then what we will obtain here is a consequence of Newton’s laws. 

It is well known that for ellipse we have: 


b=avV1-e? (33) 


where a is the semi-major axis, } is the semi-minor axis and e is the eccentricity. It is also obvious 


that the sum of the perihelion distance rp and the aphelion distance r; is twice the semi-major axis, 
that is: 
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where line 3 is from the results of problem 3 while the other lines are based on simple algebraic 
manipulation. Now, the area of ellipse is tab, so if the time period of a complete revolution is T then 
the areal speed of planet (which is constant) should be given by: 
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am (36) 
where in line 2 we use Eq. 32, in line 4 we use Eq. 33, and in line 7 we use Eq. 35. Now, since An” ig 


GM 
constant then the last equation means that the square of the planetary time period is proportional to 


the cube of the mean distance (which is represented by a since a is the mean of rp and r; as seen in Eq. 
34). This result is no more than Kepler’s third law and hence this law is a consequence of Newton’s 


laws. 
Note: for circular orbit, the derivation of the Kepler’s third law from Newton’s laws can be obtained 
more easily by equating the magnitude of centripetal acceleration w?r (where w = aE is the angular 


speed and r is the orbital radius) to the magnitude of gravitational acceleration cH , that is: 
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which is the same as Eq. 36 with r= a. 
Exercises 


1. What is the relation between Kepler’s laws and Newton’s law of gravity? 

2. Show that for a binary system whose objects revolve around their center of mass Kepler’s third law 
keeps its form with a standing for the semi-major axis of the ellipse that represents the orbit of one 
object in the rest frame of the other object while M stands for the sum of the masses of the two 
objects. (Hint: you can simplify the analysis by assuming circular motion.) 


1.6 General Relativity versus Classical Gravity 


The classical theory of gravitation is based on considering gravity as a force originating from matter 
(mass) while general relativity is essentially a geometric theory in which gravity is seen as an effect of 
the distortion (or curvature) of spacetime due to the presence and movement of matter and energy. In 
brief, classical gravity is a physical force like electromagnetic force while general relativistic gravity is a 
geometric effect of spacetime distortion. In the following points we briefly compare the two theories: 

e Classical theory of gravity is a physical theory while general relativity is a geometric theory of a physical 
phenomenon. 

e Gravity in classical theory is a physical force while gravity in general relativity is an effect of the 
spacetime curvature. 

e Classical gravity has no temporal dependency and hence its space is the familiar Euclidean 3D space 
while general relativity is a theory of spacetime and hence it belongs to a 4D Riemannian space which 
includes a temporal dimension as well as spatial dimensions. 

e General relativity converges to classical gravity in the classical limit, ie. weak, time independent 
gravitational field at low speed with the source of gravity being restricted to matter (see § 5). 

We should remark that in the above comparison we essentially restrict our attention to general relativity 
as a gravity theory, i.e. we do not consider its status as a “General Theory” with its extended formalism. 
In fact, from this perspective this should make another difference between classical gravity and general 
relativity since general relativity is more general than classical gravity because it claims to provide general 
rules of transformation between reference frames and hence it formulates the laws of physics in an invariant 
form in all types of frame. 


Exercises 
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1. What are the manifolds that underlie the classical theory of gravity and general relativity? 
2. Outline the different philosophical views which the classical theory of gravity and general relativity 
rest upon. 


1.7 General Relativity versus Special Relativity 


In brief, both special relativity and general relativity are theories about spacetime but the spacetime of 
special relativity is void of matter and energy!3?! (and hence it is flat) while the spacetime of general 
relativity is not void (and hence it is curved). In the following points we briefly compare the two theories: 
e Special relativity is a theory about spacetime coordinate transformations (i.e. Lorentz transformations) 
while general relativity is essentially a theory about gravity. However, because gravity in general relativity 
is an effect of the curvature of spacetime due to the distortion caused by the distribution of matter and 
energy, the two theories share the common factor of being theories about spacetime.|*3] 

e While the spacetime in special relativity is flat Minkowskian, the spacetime in general relativity is curved 
Riemannian. 

e General relativity supposedly converges to special relativity in weak and vanishing gravitational fields 
because in these cases the spacetime is approximately flat (or flat). 

e Special relativity applies locally in the spacetime of general relativity (i.e. even in strong gravitational 
fields) since this spacetime is locally flat although it is globally curved. 

Regarding the relation between special relativity and general relativity and if special relativity is con- 
tained in general relativity or they are two different theories, it is not a clear cut issue and hence both 
opinions have followers and supporters. The opinion that special relativity is contained in general relativity 
may be supported by the following: 

(a) The domain and subject matter of the two theories is spacetime where this spacetime is curved in 
general relativity and flat in special relativity. Hence, considering the domain of the two theories, the 
special-general relation is justified since flat space is a special case of curved space. 

(b) Special relativity applies locally in the spacetime of general relativity and hence in this sense it is a 
special case of general relativity and contained in it. Similarly, general relativity supposedly converges to 
special relativity in weak and vanishing gravitational fields and hence special relativity is contained (as a 
special case) in general relativity. 

(c) The invariance principle of general relativity is more general than the invariance principle of special 
relativity (see § 1.8.1). 

(d) According to the equivalence principle (see § 1.8.2), accelerating frames (which are supposedly more 
general than inertial frames of special relativity) are equivalent to gravitational frames which are the 
primary subject of general relativity. 

On the other hand, the opinion that special relativity and general relativity are different theories may 
be supported by the following: 

(A) Special relativity is not contained in general relativity in the sense that the formalism of special 
relativity can be obtained from the formalism of general relativity!®“! like classical mechanics in its relation 
to Lorentz mechanics (see B4). 

(B) The local application of special relativity in the spacetime of general relativity is because this condition 
is imposed rather arbitrarily in general relativity (based on the local flatness of the spacetime of general 


[32] In fact, it is void of matter and energy as source of gravity. 

[33] As indicated earlier and will be discussed further later on, general relativity is also a “General Theory” and not only a 
gravity theory. So, from the “General Theory” perspective it may be seen as a generalization to special relativity due to 
the local application of special relativity in the spacetime of general relativity and hence it is also a theory of spacetime 
from this perspective. 

[34] The formalism of general relativity here refers to the gravitational formalism (as represented by the Field Equation 
and its direct consequences) although the formalism of general relativity supposedly includes the formalism of special 
relativity in its local application. In brief, the gravitational component of the formalism of general relativity (i.e. the 
Field Equation and its consequences) and the Lorentzian (or special relativistic) component of the formalism of general 
relativity (ie. Lorentz transformations and their consequences) are two independent formalisms that are pieced together 
(rather arbitrarily) in general relativity where no one of these formalisms can be obtained from the other (like obtaining 
classical formalism from Lorentzian formalism as a special case which we demonstrated in B4). 
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relativity) and hence it is not a natural convergence of the formalism of the theory itself. In fact, the 
local application of special relativity may qualify special relativity to be the more general theory from this 
perspective. Similarly, the alleged convergence to special relativity in weak and vanishing gravitational 
fields (also based on the approximate flatness and flatness of spacetime in these cases) is a convergence of 
the underlying spacetime and not a convergence of one formalism to the other. Yes, the geometric paradigm 
of general relativity (i.e. gravity is an attribute of spacetime curvature) implies such a convergence but 
it is not the same as the actual convergence of one formalism to the other. Moreover, this requires 
physical evidence to be fully established unlike the convergence of the formalism which is a fact that 
can be established by purely theoretical means. To demonstrate the nature of this alleged convergence 
(whether the convergence locally or the convergence in weak and vanishing fields) we need just to compare 
the convergence of the formalism of general relativity to the formalism of Newtonian gravity in the 
classical limit (see § 5) with the alleged convergence of general relativity to special relativity to see the 
obvious difference between the types of these convergences. We may also consider in this comparison the 
convergence of the formalism of special relativity to the formalism of classical mechanics in the low-speed 
limit. 

(C) Special relativity is about transformation of spacetime coordinates which should apply (when its 
conditions are met) to any physical subject and theory!?*! while general relativity is primarily about 
gravity and hence special relativity is the more general theory from this perspective (see exercise 6). The 
requirement of the strong equivalence principle (see § 1.8.2) which extends the theory of general relativity 
to all physical laws is arbitrary; moreover it requires validation by physical evidence.!°4 


Exercises 


1. Give examples of common features between special relativity and general relativity. 

2. Give examples of features that distinguish special relativity and general relativity from each other. 

3. Compare the fundamental principles which special relativity and general relativity rest upon. 

4, List some of the features (such as foundations, implications and predictions) of special relativity and 

the features of general relativity. 

What “special” and “general” mean with regard to the two relativity theories? Justify these labels. 

6. It is commonly claimed that special relativity is less general than general relativity. Can we claim that 
in a sense special relativity is more general than general relativity? 

7. Assess the relation between special relativity and general relativity and if special relativity is contained 
in general relativity or they are two different theories. 


or 


1.8 General Principles 


In this section we discuss the principles which general relativity is based on. We note that these principles 
are mostly epistemological in nature with scientific implications although some have prime and explicit 
scientific content. 


1.8.1. The Principle of Invariance 


The essence of this principle is that observers in different frames of reference between which certain 
coordinate transformations apply should observe and record the same laws of physics, i.e, the laws of 
physics take the same form regardless of the frame of reference.7! This principle may also be stated as: 


[35] As explained earlier (see § 1.2), special relativity (or rather Lorentz mechanics) should have been extended to include 
even gravity if Lorentz transformations are the right transformations. 

[36] We should note that general relativity (even as a gravity theory) also provides a sort of transformations between local 
frames in a given spacetime (as demonstrated for example by time dilation where the time in one local frame is correlated 
to the time in another local frame). However, these “local transformations” are not for the purpose of transforming the 
laws invariantly between frames. 

[37lIn the literature of general relativity, the principle of invariance is commonly known as the principle of covariance (or 
general covariance). However, we mostly avoid using this term to avoid confusion with covariance of tensors. This 
should be further justified by our intention (in this section at least) to investigate the principle of invariance in general 
whether in general relativity or elsewhere. We note that covariance in this context means form invariance. We should 
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all frames of reference (possibly of a certain category such as inertial) are suitable for formulating the laws 
of physics. For example, in classical physics the laws of mechanics take the same form in all inertial frames 
under the Galilean transformations. This similarly applies to Lorentz mechanics (or special relativity), 
since the domain of this mechanics is also inertial frames, but under the Lorentz transformations. In 
general relativity, this invariance principle is extended to include all frames and observers (although 
usually locally) and hence the “inertial” restriction is lifted. Accordingly, the principle of invariance in 
general relativity is more general than that in classical mechanics and in Lorentz mechanics. 

The justification of the principle of invariance is that objective laws of physics should not depend on 
our conventions and how we choose our coordinate systems and frames of reference. In other words, there 
should be no privileged frame of reference in which the laws of physics take a particular form. Accordingly, 
real physical laws that truly describe the behavior of Nature in an objective and impartial way should be 
expressed in a form that is independent of observers and frames of reference. 

The principle of invariance of physical laws can be characterized by the following: 

1. This principle is an epistemological principle rather than a scientific principle. In other words, it is a 
sort of philosophical convention about what should be considered a valid physical law and what should 
not be considered so. Accordingly, this principle is not a scientific principle that is based on experiment 
and extracted from observation of Nature (such as the principle of energy conservation). Instead, we 
choose a certain set of transformations and we label those rules of Nature that remain invariant under 
these transformations (according to our observations) as laws of physics. 

2. This principle is about the form of physical laws, i.e. any valid physical law should take the same 
form in different frames of reference under certain transformations. In our book B4 we distinguished 
between form invariance and value invariance. Accordingly, the principle of invariance is about form 
invariance and not about value invariance. 

3. This principle requires a domain of validity which determines the type of frames to which this principle 
applies. This domain may be a particular category of reference frames or all reference frames. For 
example, the domain of this principle in classical mechanics and in Lorentz mechanics is inertial frames 
while the domain in general relativity is reference frames in general whether inertial or accelerating or 
gravitational. 

4. This principle requires a set of coordinate transformations under which the laws are required to keep 
their form. For example, in classical mechanics the principle of invariance requires the form of legitimate 
physical laws to be invariant under the Galilean transformations, while in Lorentz mechanics it requires 
this form to be invariant under the Lorentz transformations. In general relativity, it supposedly requires 
the form to be invariant under “general transformations”. 

5. The validity of the invariance principle is restricted to the determination of which rule is a legitimate 
candidate for becoming a physical law and hence the invariance principle has no validity in determining 
the actual physical laws. The latter validity belongs to the available experimental and observational 
evidence in support of the rule. In other words, form invariance is a necessary but not sufficient 
condition for a rule to be a law of physics, i.e. all laws of physics should be form invariant but not all 
form invariant rules should be laws of physics. In fact, this restriction is particularly important in the 
framework of general relativity (and especially in assessing the validity of the principles of equivalence 
and General Covariance) where we will see that the existence of a valid formal transformation does 
not guarantee the validity of the transformation physically in the real world. 


We should remark that from a historical perspective, the principle of invariance started with the laws of 
mechanics which are considered invariant by Galileo and the pioneers of classical mechanics under what 
is subsequently called the Galilean transformations. It was then extended to include electromagnetism 
(and later to all laws of physics) by Poincare and the pioneers of Lorentz mechanics where the Galilean 
transformations were replaced by the Lorentz transformations (with some amendments to the laws of 
mechanics). |°8] The principle was then applied with further elaboration and generalization during the 


also draw the attention of the readers to the existence of a technically-specific Principle of General Covariance which 
will be discussed in § 1.8.2. 

[38] Some historical records seem to indicate that the generalization of the principle of invariance belongs to Maxwell who was 
quoted to talk about “the doctrine of relativity of all phenomena” and “a formula of the same type” (although seemingly 
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development of general relativity and incorporated in its more general form in this theory.!89! So, if we 
include this historical factor in the characterization of the principle of invariance then we can say: the 
principle of invariance is characterized by its domain (e.g. inertial frames), type of transformations (e.g. 
Galilean transformations) and subject (e.g. mechanics). 

Another remark is that since general relativity is essentially a theory of gravity, there is no actual 
coordinate transformations between frames in this theory. Yes, it is claimed in this theory that the 
results and rules of Lorentz mechanics, which are based on the validity of Lorentz transformations, apply 
locally but this is not the same as applying transformations between frames as it is the case in classical 
mechanics with regard to the Galilean transformations or in Lorentz mechanics with regard to the Lorentz 
transformations.!4° This difference arises from the fact that, unlike Lorentz mechanics for example which 
is a theory about coordinate transformations, general relativity is fundamentally a theory of gravity and 
not a theory of coordinate transformations of spacetime although it is based on formulating gravity in 
terms of spacetime and its geometric properties. Accordingly, we may question the general relativistic 
claims about the generality of the invariance principle with regard to frames of reference (i.e. domain) 
and if this is implemented and realized in a meaningful way by the theory. Yes, it is claimed that the 
same laws of physics apply in all frames of reference, and hence the laws should be cast in an invariant 
tensorial form and they are subject to the most general type of transformations, even though we may 
not actually apply coordinate transformations as we do in Lorentz mechanics. In fact, we can question 
even the generality of the invariance principle in general relativity with regard to the laws of physics (i.e. 
subject) because even if we accept the claims that general relativity passed all the gravitational tests (see 
§ 9), there is no concrete evidence for the invariance of all physical laws in gravitational and accelerating 
frames. The reality is that the strong equivalence principle (see § 1.8.2) is not experimentally verified and 
hence the verified gravitational results may establish the weak, but not the strong, equivalence principle. 
In other words, the claimed evidence in support of general relativity is related to the theory as a gravity 
theory and not as a “General Theory”. In brief, as a gravity theory general relativity is not a theory of 
transformation and hence it is not concerned with the invariance of physical laws, while as a “General 
Theory” there is no actual implementation or demonstration or verification of the alleged invariance and 
hence this invariance is a trivial claim with no substantial evidence in its support (and possibly there 
is actual evidence against it as it might be claimed based on the evidence against the general validity 
of the strong equivalence principle). In fact, even the logical essence of the strong equivalence principle 
(as well as the Principle of General Covariance) is questionable. As indicated earlier, the existence of a 
valid formal transformation does not guarantee the validity of the transformation physically because the 
invariance is a necessary but not sufficient condition for determining what should be a law of physics and 
what should not be. 


Problems 


1. Justify the demand for tensor calculus in formulating the laws of physics. 
Answer: This demand may be justified by the requirement of the principle of invariance because 
tensor formulation is invariant under coordinate transformations. So, the equations that represent 
physical laws are guaranteed to be form invariant under coordinate transformations if they are cast in 
tensor form. 

2. What is the implication of the premise that the principle of invariance is an epistemological principle 
rather than a scientific principle? 


without suggesting a specific type of transformations). The invariance of the speed of light (which is related to the 
principle of invariance and the principle of relativity) may also be inferred from his writings and work. In fact, even the 
roots of what we now call special relativity (or rather Lorentz mechanics) can be detected in some of the writings of 
Maxwell. 

[39] Our view (which is based on the available historical evidence) is that the principle of invariance in its general form (i.e. 
the requirement that all physical laws should be form invariant under certain coordinate transformations) was coined 
by the pioneers of Lorentz mechanics (like Poincare) although there seems to be no actual work on implementing or 
demonstrating or verifying this principle outside mechanics and electromagnetism. As we will see, the alleged implemen- 
tation of this principle in general relativity is trivial and hence we do not see any priority to Einstein in generalizing this 
principle. This should also apply to the generalization of this principle with regard to the type of frames. 

[49] Tn fact, Lorentz transformations are not valid globally in general relativity as in special relativity. 
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Answer: The obvious implication is that its value is rather restricted unlike the value of fundamental 
physical principles (e.g. the conservation of energy) which acquire their legitimacy and validity from the 
authority of Nature itself and hence they cannot be violated or modified by our needs or conventions. In 
other words, the principle of invariance determines what type of science we should create and construct 
and this can be susceptible to changes or modifications depending on our needs and conventions which 
are largely materialized in our philosophical and epistemological framework. For example, if we find it 
useful to use non-invariant set of “laws” in a certain frame or category of frames then there is nothing 
that can prevent us from doing this because there is nothing that is fundamentally wrong with this. 
In fact, large parts of physics are cast and used in non-invariant form (inconsistent with the demand 
of general relativity).|41 


Exercises 


1. Discuss the “covariance” label that is attached in general relativity to the principle of “invariance”. 

2. Our assertion that the principle of invariance is an epistemological rather than a scientific principle may 
be challenged by the fact that certain laws at least (e.g. Newton’s second law) are actually invariant 
regardless of any convention. How do we reply? 

3. How is the principle of invariance implemented in general relativity? 

4. Question, briefly, the applicability and generality of the principle of invariance in general relativity and 
if it is really implemented in general under any sort of coordinate transformations. 

5. What is the difference between the invariance principle of special relativity (i.e. the first postulate) 
and the invariance principle of general relativity? 

6. Briefly discuss the historical development of the principle of invariance with regard to the subject of 
this principle. 

7. Lorentz tensor in the frames of 4D spacetime of Lorentz mechanics is the equivalent of the Jacobian 
matrix in spatial coordinate systems (see B4). So, what is the tensor that corresponds to the Lorentz 
tensor (i.e. Jacobian matrix) in general relativity? 

8. Argue against the invariance principle of physical laws and demonstrate its limited value due to its 
epistemological and conventional nature. 

9. Discuss the relation between the principle of relativity and the principle of invariance. 

10. According to the principle of invariance there should be no privileged frame of reference in which the 
laws of physics take a particular form. Discuss this briefly. 


1.8.2 The Principle of Equivalence 


The principle of equivalence is essentially a scientific principle although it has epistemological contents 
and implications as well. In simple terms, the principle of equivalence states that a uniformly accelerated 
reference frame is locally indistinguishable from a frame in a gravitational field, i.e. acceleration in a 
given direction is equivalent in effect to gravitation of suitable magnitude in the opposite direction. This 
is inline with the classical view that acceleration introduces fictitious forces from the perspective of the 
accelerating frame and hence it may be seen as a justification of the general relativistic view that gravity 
is a “fictitious” force due to its equivalence to the effect of acceleration. As we will see, there are different 
forms of this principle in the literature of general relativity. Some of these forms are essentially equivalent 
or they differ in phrasing and presentation but others are fundamentally different, e.g. weak and strong 
forms. The roots of the equivalence principle can be traced back to the days of Galileo with his observation 
of the independence of the gravitational acceleration from the mass of the accelerated object which cannot 
be explained without assuming the equivalence between inertial mass which is related to acceleration and 
gravitational mass which is related to gravity (see questions). So, the idea of a connection or similarity 
between acceleration and gravity is not a novelty or an invention of general relativity. 

The equivalence principle may by stated generically in a more formal way as: “there is no experiment 
conducted in a sufficiently small region of spacetime that can distinguish between a gravitational field and 


[411 As we will see, we distinguish between the invariance of the essence of the laws and the invariance of the form of the 
laws. Accordingly, the above discussion is mainly about the latter. 
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an equivalent uniform acceleration”.!4?! Accordingly, no experiment can distinguish between the effects of 
gravity and the effects of acceleration at local level (where “local” in this context means local in spacetime 
and not only local in space). As we see, there are several limitations in the equivalence principle, e.g. 
“sufficiently small region of spacetime” and “uniform acceleration”. It may also be challenged by claiming 
that the equivalence principle can apply to any force and hence gravity is not special in this regard. For 
example, how can we distinguish between the electric force that is subject to the above restrictions and 
conditions and the effect of acceleration? In other words, what distinguishes gravity from other forces in 
its equivalence to acceleration which eventually leads to stripping the “force” qualification from gravity 
and reducing it to a mere effect of spacetime curvature while keeping this qualification to other types of 
force? The only potential distinction seems to be the association of gravity with mass while other forces 
are associated with other attributes of matter like charge. However, this distinction is virtually circular 
because we are essentially distinguishing gravity by gravity when we take mass into account since gravity 
is an attribute of mass. We note that this challenge to the equivalence principle is mainly directed to 
the weak form of this principle in its general relativistic sense. We should also note that in the literature 
of general relativity there are justifications for the difference between gravity and other forces from this 
perspective, e.g. the acceleration of gravitated object depends on the ratio of gravitational mass to inertial 
mass which is universal while the acceleration of charged object depends on the ratio of electric charge 
to inertial mass which is object-dependent. However, these justifications may be questioned because they 
do not seem to address the fundamental issue raised in this challenge. 

The above criticism to the equivalence principle in its general relativistic sense may be consolidated 
by the existence of field gradients and tidal forces in the gravitational frames but not in the accelerat- 
ing frames. In other words, acceleration may partially account for gravity and its effects but it cannot 
substitute gravity and account for all its effects. The condition of locality may avoid some unwanted con- 
sequences (by virtually annihilating the effect of field gradients and tidal forces) but it does not guarantee 
the equivalence. Moreover, even the locality is conditioned by the limitations on resolution in any real 
physical experiment (i.e. the actual resolution may not eliminate the effects of field gradients and tidal 
forces locally) and this should be specially obvious in strongly non-uniform gravitational fields.!43] There- 
fore, we may conclude that gravity and acceleration are fundamentally different although the similarity 
between their effects, as embedded and expressed in the equivalence principle of general relativity under 
certain conditions like locality, may be used to formulate an approximate or phenomenological theory of 
gravity. However, if this conclusion is valid in principle then it is possible to run the risk of ill-formulating 
and misinterpreting gravity and drawing illegitimate consequences and conclusions from such formulation. 

It is noteworthy that some relativists believe that the equivalence principle is completely wrong or 
at least redundant and hence this principle provides motivation and starting point (rather than valid 
theoretical foundation) for the theory of general relativity (see for example Synge in the References). As 
we will see, the rejection of the equivalence principle with the acceptance of general relativity may be 
logically consistent with regard to certain forms of the equivalence principle (e.g. the weak form which 
essentially underlies the gravity theory) but not with regard to other forms (e.g. the strong form which 
essentially underlies the “General Theory”). In fact, the topic of the equivalence principle, its classification, 
definition and interpretation is very messy in the literature (although this is a characteristic feature of the 
relativity theories in general). This mess also extends to the invariance (or covariance) principle due in 
part to the close relation between the two principles. The mess also includes the relation between these 
principles (which may be combined into or linked by the Principle of General Covariance) as well as their 
relation to the “principle of relativity”. 

Regarding the different forms of the principle of equivalence in general relativity, we can distinguish two 


[42] We use terms like “acceleration equivalent or corresponding to gravity” or “gravity equivalent or corresponding to ac- 
celeration” to indicate the aforementioned meaning (i.e. suitable magnitude in the opposite direction) with the locality 
condition being observed. 

[43] Tm fact, the limitations on the locality are not restricted to the observational limits on resolution but they extend to 
the intrinsic limitations on the phenomenon itself because as the local region shrinks other effects (e.g. microscopic and 
quantum) will get involved and this should invalidate the equivalence principle in its conventional interpretation. In 
such cases, a different theory (or a modified theory such as “quantum general relativity”) should be sought where the 
conventional equivalence principle can (at the best) play only a part in the physical formulation. 
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main forms: weak and strong.!44] These forms may be stated as follows: 

e Weak equivalence principle: in a sufficiently confined region of spacetime, the mechanical (or kine- 

matical) effects caused by gravity cannot be experimentally distinguished from the mechanical effects of 

corresponding uniform acceleration. 

e Strong equivalence principle: in a sufficiently confined region of spacetime, the physical effects caused 

by gravity cannot be experimentally distinguished from the physical effects of corresponding uniform 

acceleration. In other words, the physical laws in confined gravitational frames are the same as the 
physical laws in the corresponding accelerating frames.|4°] 

So, what distinguishes the two forms of the equivalence principle is that the weak form is about the 

mechanical effects of acceleration and gravity (where these effects may be demonstrated for example by 

motion) while the strong form is about any physical effect and hence it is more general than the weak. 

This should explain why they are “weak” and “strong”. 

In the following points we discuss some of the implications and consequences of these two forms: 

1. The weak equivalence principle is essentially about gravitational free fall and hence it may be called 
the principle of universality of free fall. 

2. Although the weak equivalence principle is similar to the classical equivalence principle, they are not 
identical (as might be suggested in the literature). Our view is that the essence of the classical equiv- 
alence principle is the equivalence (or equality or identicality) between inertial mass and gravitational 
mass and this is not the same as the weak equivalence principle of general relativity although the 
classical form is embedded (in a sense) in the weak form of general relativity and could lead to the 
experimental consequences of the weak form (see questions). In fact, the weak form contains a the- 
oretical or epistemological component (which is experimentally unverifiable) that is not contained in 
the classical form. So, while the classical form is a verifiable experimental fact the weak form (as such 
and in its entirety) is not.|4¢ 

3. The equivalence principle (especially the weak form) is based on the assumption that the gravitated 
object does not modify the gravitational field created by the gravitating object tangibly, i.e. the 
gravitated object is a test particle. Otherwise, the gravitational field will be more complicated for 
the equivalence principle to apply since there is no such a simple acceleration that can simulate the 
effect of the combined gravitational fields. In fact, this should put more question marks on the validity 
of the equivalence principle since there are situations in which no acceleration can be equivalent to 
gravity. This should be added to non-uniform acceleration which has no physically-viable gravitational 
equivalent. Also, decelerating frames and rotating frames (which are accelerating frames in the general 
sense of acceleration) have no physically-viable gravitational equivalent.!47! In brief, even if we assume 
that the equivalence between acceleration and gravity is valid in some cases, it is not valid in general 
and this should degrade the equivalence principle and diminish its value and significance (and even its 
validity as a general physical principle) especially with regard to its objective of extending the laws 
of physics to all types of frame (noting that the strong equivalence principle underlies the “General 
Theory”). 

4. If we couple the strong equivalence principle with the premise that the spacetime of general relativity is 


[44] We should note that we have a third main form of the equivalence principle which is the classical form (i.e. the equivalence 
between inertial mass and gravitational mass). This third from is not included here because it is not specifically general 
relativistic since it essentially belongs to classical physics although it represents the root of the general relativistic forms 
of the equivalence principle. 

[45] As we will see, the strong form of the equivalence principle also contains generalizations with regard to the local application 
of the rules of special relativity and hence the physical laws as represented by these rules equally apply in all types of 
reference frame locally. In fact, this should be implied by the locality condition in Riemannian space (which is locally 
flat) plus the alleged validity of special relativity in flat spacetime in general (whether global or local). This validity 
should be based on the fundamental geometric doctrine of general relativity which correlates the physical laws to the 
geometry of spacetime. 

[46] In our view, the distinction between the classical and weak forms should also consider the difference in the nature of the 
relation between the inertial mass and gravitational mass (i.e. the difference between “identical” and “equivalent”) as will 
be discussed in the questions. 

[471 In fact, if we have to accept the distinction between various types of mass (as it is the case in general relativity), then 
even the equivalence between inertial accelerating mass and inertial rotating mass can be questioned. 
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locally flat (and hence the laws of special relativity apply locally in the spacetime of general relativity), 
then we may claim that the strong form is more than an “equivalence” principle between gravity and 
acceleration but it contains an extra claim (i.e. the equivalence of all frames in the local application of 
special relativity) that has no counterpart in the weak form as stated above. 48] If so, then the essence of 
the strong equivalence principle is that all the laws of physics apply in all frames of reference (whether 
inertial, accelerating or gravitational) where these frames coordinate locally flat patches of spacetime 
and hence they are subject to the rules of special relativity at local level (noting that any non-special- 
relativistic effect introduced by gravity or/and acceleration should be accounted for automatically by 
the coordinate transformations between the locally inertial frames and the corresponding non-inertial 
frames). Accordingly, the equivalence principle in its strong form contains an element of invariance.!49! 
As we will see, the strong equivalence principle is what elevates general relativity from the state of 
being a mere gravity theory to the state of being a “General Theory”. 


Problems 


1. Briefly discuss the history of the experimental investigations of the equivalence principle. 

Answer: As stated in the text, the roots of the equivalence principle can be traced back to the days 
of Galileo with his alleged Leaning Tower of Pisa experiments or his rolling balls experiments. In fact, 
some historical records trace the roots of this principle (as represented by the independence of grav- 
itational acceleration from mass and composition) to the Byzantine scholar John Philoponus in the 
6” century AD. There are also some historical records of experiments similar to the alleged Leaning 
Tower experiments of Galileo conducted (prior to Galileo) by the Dutch scholar Simon Stevin around 
1580s. Similar experiments have also been attributed to Newton where he used pendulums of equal 
length with bobs made of different masses and materials and observed that they swing with identical 
speeds and periods. There are also reports that the German physicist Friedrich Bessel conducted sim- 
ilar experiments to those of Newton around 1830s using pendulums where he confirmed the previous 
results. The Hungarian physicist Lorand Eotvos conducted a series of highly precise experiments in 
the late 19” century and early 20°” century using torsion balance techniques and verified the equiva- 
lence between inertial mass and gravitational mass to unprecedented level of accuracy. More accurate 
experiments were conducted recently (e.g. the Eot-Wash group series of gravitational experiments in 
the late 20” century and early 21%’ century) all of which verified the equivalence between inertial 
mass and gravitational mass to very high precision and hence confirmed the equivalence principle. We 
should remark that some of the aforementioned experiments included testing potential variations and 
correlations not only with mass but even with other factors such as the type of materials used and the 
location on the Earth or space. Also, some of those experiments are based on celestial (rather than 
terrestrial) tests and techniques. 

It is very important to note that these investigations are related (at least primarily) to the classical 
equivalence principle and hence they are actually verifications to the classical form rather than the 
weak or strong forms (despite the common claim among general relativists that these experiments are 
verification and evidence in support of general relativity and its equivalence principle; see § 9.12 and 
§ 10.1.1). 

2. Show formally, using classical concepts and formalism and employing the classical equivalence principle, 
that the gravitational acceleration is independent of the mass of the gravitated object. How can this 
explain the fact that the roots of the equivalence principle are embedded in classical physics? 
Answer: If we symbolize the inertial mass with m,; and the gravitational mass with m, then from 
Newton’s second law we have m; = £ where f and a symbolize the magnitude of force and acceleration, 


while from Newton’s law of gravity we have m, = pL where d is the distance from the Earth center, 
G is the gravitational constant and M is the mass of the Earth. So, if these two types of mass are 


[48] We deliberately phrased the strong equivalence principle above in a simple and rather non-comprehensive form to highlight 
the correspondence between the weak and strong forms. 

[49] This should be inline with the Principle of General Covariance (and similar interpretations) which will be investigated 
later. 
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equivalent (i.e. m; = m,)®°l then we should have: 
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which justifies the fact that the “gravitational acceleration” g of the Earth is independent of the mass 
of the gravitated object. In fact, the gravitational acceleration of the Earth is just an example and 
hence the result is general. 
As we see, the independence of gravitational acceleration from the mass of the gravitated object is 
based on the equivalence between inertial mass and gravitational mass and this explains the fact that 
the roots of the equivalence principle are embedded in classical physics. 

3. Discuss briefly the issue of the relation between the principle of invariance (or general covariance) and 
the principle of equivalence. 
Answer: This very important issue is not clear cut and hence we can find various opinions and inter- 
pretations in the literature not only about the relation between these two principles but even about 
the meaning and significance of each of these principles individually (where the proposed meaning 
should affect the supposed relation). We expressed our opinion about the meaning of these principles 
and the relation between them in several places in this book (earlier and later) and hence we do not 
need to repeat. Regarding the opinions of other authors, we cannot go through these in detail but it is 
important to highlight a particularly significant opinion which essentially unifies (or combines or links) 
the two principles and expresses this in a technically-specific principle which is labeled as the Principle 
of General Covariance. The following quote from Weinberg (see References) eloquently represents this 
view: 
This method is based on an alternative version of the Principle of Equivalence, known as the Principle 
of General Covariance. It states that a physical equation holds in a general gravitational field, if two 
conditions are met: 
1. The equation holds in the absence of gravitation; that is, it agrees with the laws of special relativ- 
ity when the metric tensor gjg equals the Minkowski tensor 7, and when the affine connection T'3, 
vanishes. 
2. The equation is generally covariant; that is, it preserves its form under a general coordinate trans- 
formation x > x’. 
To see that the Principle of General Covariance follows from the Principle of Equivalence, let us sup- 
pose that we are in an arbitrary gravitational field, and consider any equation that satisfies the two 
above conditions. From (2), we learn that the equation will be true in all coordinate systems if it is 
true in any one coordinate system. But at any given point there is a class of coordinate systems, the 
locally inertial systems, in which the effects of gravitation are absent. Condition (1) then tells us that 
our equation holds in these systems, and hence in all other coordinate systems. (End of quote) 
The implications of this particular Principle of General Covariance will be discussed later in the text 
and questions (see for instance § 10.1.5) where we will use the capitally-initialized italic form “Principle 
of General Covariance” to refer to this technically-specific meaning of this principle (which combines 
the invariance and equivalence principles). 

4. Make a brief comparison between the weak and strong forms of the equivalence principle of general 
relativity. 
Answer: We note the following: 
e The weak form is about the mechanical effects (of acceleration and gravity) specifically while the 
strong form is about the physical effects in general. 
e The weak form underlies the gravity theory while the strong form underlies the “General Theory”. 


Exercises 


1. Discuss the following quote which is attributed to Einstein: “The equality of these two masses |i.e. 
inertial mass and gravitational mass], so differently defined, is a fact which is confirmed by experiments 


[50] Th fact, what is needed is proportionality between these two types of mass, the equivalence then follows by choosing a 
proper unit system in which the two become equal. 
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19. 
20. 
21. 
22. 
23. 
24. 
25. 


26. 


of very high accuracy (experiments of Eotvos), and classical mechanics offers no explanation for this 
equality. It is, however, clear that science is fully justified in assigning such a numerical equality only 
after this numerical equality is reduced to an equality of the real nature of the two concepts”. 


. Distinguish between the different forms of the principle of equivalence and the scientific evidence in 


their support. 

What is the difference between the weak and strong forms of the equivalence principle of general 
relativity? 

Discuss briefly the importance of the strong equivalence principle for the theoretical framework of 
general relativity. 

Discuss the significance of the weak and strong forms of the equivalence principle and their relation to 
the nature of general relativity. 

Summarize your main observations about the strong equivalence principle. 

The equivalence principle of general relativity may be seen as a bridge between geometry and physics. 
How? 

The equivalence principle of general relativity may be seen as having two sides (or meanings): static 
and dynamic. How? 

Interpret the equivalence principle of general relativity in the light of the locality of frames and hence 
determine the role of general relativity and the procedure that should be followed in its application. 


. Examine the significance of the locality restriction in the statement of the equivalence principle. 
. Analyze the condition “sufficiently small region of spacetime” in the statement of the equivalence 


principle. 


. Analyze a freely falling frame in a gravitational field. 
. Show formally, using the classical equivalence principle (and assuming locality), that a freely falling 


frame in a gravitational field is effectively inertial (ie. a frame in which Newton’s laws of motion are 


valid). 


. Analyze a frame that is uniformly accelerating in the absence of any gravitational field. 
. Discuss and assess the equivalence principle in the context of free fall situation. 
. Derive a formula for the gravitational field gradient (which estimates tidal forces) along the radial 


direction in the neighborhood of a gravitating object using the classical formalism of gravity (see § 
1.5). 

Calculate the tidal force f; (i.e. force per unit mass)!>!! along the radial direction over a distance d = 1 
nm in the vicinity of the Earth (a) at its surface and (b) at distance 2R from its surface (where R is 
its radius). Comment on the results. 

Estimate the tidal force f; (ie. force per unit mass) along the radial direction over a distance d = 1 
nm in the vicinity of a black hole of 1 Earth mass (a) at the event horizon!®?! and (b) at distance 2Rg 
(where Rg is its Schwarzschild radius) from the event horizon. Comment on the results. 

Assess the validity of the equivalence principle (and hence the validity of general relativity which is 
based on this principle) in very strong gravitational fields. 

Are tidal forces restricted to the radial direction? 

Make a brief assessment to the weak and strong forms of the equivalence principle. 

Use the existence of tidal forces in gravitational systems as a basis for a potentially fundamental 
challenge to the strong equivalence principle. 

Discuss the issue of decelerating frames (as a subset of accelerating frames in its general sense) and 
to which gravity they should correspond according to the equivalence principle. Also, discuss the 
implication of this on the existence of absolute frame. 

To what type of gravitation rotating frames correspond? 

Try to challenge our criticism to the equivalence principle for its failure to include decelerating and 
rotating frames (as well as non-uniformly accelerating of all types) and assess this challenge. 

What is the relation between the invariance principle and the equivalence principle in general relativity? 


[51] What is actually required is its magnitude with the sign indicating its nature. 
[52] The event horizon of a black hole is an imaginary sphere whose center is the singularity (or center) of the black hole and 


whose radius is the Schwarzschild radius (Rg = 26M) of the black hole (see § 8.7). 
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27. Outline the technical meaning of the Principle of General Covariance. What is the procedure for 
implementing this Principle and what is the logic behind this procedure? 

28. Assess the Principle of General Covariance. 

29. Justify the following statement using the invariance and equivalence principles: “physical objects that 
are not under the influence of non-gravitational forces follow geodesic trajectories”. 

30. Identify the potentially different types of invariance as embedded in the general principles of general 
relativity. 

31. Discuss how the principle of relativity in special relativity is extended by the equivalence principle to 
the “principle of relativity” in general relativity. 3! 


1.8.3. The Principle of Correspondence 


The essence of the principle of correspondencel®4! is that any new theory of physics should agree with the 
predictions of the corresponding classical theory in its domain of validity. For example, the predictions of 
quantum mechanics should agree with the predictions of classical mechanics in the limit of large quantum 
number where classical predictions are verified. Similarly, the predictions of Lorentz mechanics (or special 
relativity) should agree with the predictions of classical mechanics in the limit of low speed where the 
classical predictions are verified (see B4). This also applies to general relativity where its predictions (as 
a gravity theory) should agree with the verified predictions of Newtonian gravity in the classical limit (see 
§ 5). 

The validity of the correspondence principle should be obvious because no correct theory should contra- 
dict the experimentally verified predictions and results of classical physics. It should also be obvious that 
the principle of correspondence is an epistemological principle. In fact, it is no more than an expression 
of the principles of reality and truth, i.e. uniqueness of reality and truth (refer to B4). As we will see in 
§ 3 and § 5, the formalism of general relativity is partly based on the principle of correspondence where 
the required convergence to the classical limit is assumed to find certain parameters in the formulation of 
general relativity. Moreover, it is shown that the formalism of general relativity converges to the formalism 
of classical gravity in certain classical cases. 


Exercises 


1. Define and discuss the correspondence principle. 

2. Is the correspondence principle restricted to physics? 

3. Discuss the applicability of the correspondence principle in the alleged convergence of general relativity 
to special relativity at local level. 


1.8.4 Other Principles 


It is important to note that the principles that we investigated in the last three subsections represent 
the formally nominated and commonly recognized principles; otherwise the theory of general relativity 
relies on other principles as well. For example, we can nominate “the principle of metric gravity” (which 
we may also call “the principle of geometric gravity” or “the principle of curvature”) whose essence is 
the representation of gravity by the geometry of spacetime and its curvature which is determined by 
the mass-energy distribution. We may also nominate “the principle of geodesic motion” whose essence is 
the geodesic nature of the spacetime trajectories of free objects (although this principle may be derived 
from the formalism of the theory and may even be implied by the former principle), [55] In fact, general 
relativity (like any other scientific theory) relies on many other general principles such as the principle of 


[53] Tn fact, this question is about the relation between the principle of relativity and the general principles of general 
relativity. The restriction to special relativity is because there is no such explicit “principle of relativity” in general 
relativity (although its essence is embedded in the framework of the theory). 

[54] This may also be called the consistency principle to be distinguished from the correspondence principle of quantum 
mechanics, i.e. the consistency principle is a generalization of the correspondence principle of quantum mechanics. 

[55] We note that the principle of metric gravity is essentially epistemological while the principle of geodesic motion is 
essentially physical. 
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causality (i.e. any effect has a cause whose existence precedes the existence of the effect). However, we 
do not go through these principles in detail due to their simplicity or/and generality (and hence they do 
not require elaborate explanation or special attention as general relativistic principles). 


1.9 Criteria of Scientific Theory 


For any theory to be qualified as an acceptable scientific theory it should comply with the following 
criteria: 

(a) It should be consistent with the rules of logic because logic is the most fundamental component of any 
rational intellectual product. 

(b) It should be consistent with the principles of reality and truth (i.e. the existence and uniqueness of 
reality and the uniqueness of truth) because otherwise the theory will defeat itself. 

(c) It should be entirely about the physical world because science is about this world. 

(d) It should be consistent with other facts and verified theories. 

(e) It should be practical to use and apply because science (unlike philosophy and mathematics for 
example) is about physical reality and hence it should have a practical (as well as theoretical) value and 
significance. An important instance of the practicality criterion is that it should be possible to verify and 
falsify the theory, i.e. it can be proved to be right or wrong experimentally. 


Problems 


1. What criterion (c) in the above list actually means? 
Answer: This criterion does not only require that the subject of the theory should be a natural 
phenomenon that belongs to the physical world but it also requires that the theory should not involve 
any non-physical element, entity or concept in its framework and therefore the theory should not 
include any metaphysical or supernatural element or component. 

2. Link criterion (d) in the above list to another criterion in the list. Also, link criterion (d) to the 
principles that we investigated in § 1.8. 
Answer: Criterion (d) is obviously based on criterion (b) as a requirement for the uniqueness of truth. 
Criterion (d) is also linked to the principle of correspondence which was investigated in § 1.8.3. 


Exercises 


1. Can a creation theory be a scientific theory? 

2. Assess general relativity in the light of the above criteria. 

3. Give broad criteria for ideal scientific theory and assess the theories of modern physics from this 
perspective. 

4. Clarify criterion (e) in the list that we stated in the text giving some examples. 

5. Are the criteria for acceptable scientific theory (as stated in points a—e in the text) sufficient or/and 
necessary conditions (or not) for the truthfulness of a presumed scientific theory? 

6. Why metaphysical entities cannot enter in the formulation of a physical theory? 

7. Give more justification to the requirement of consistency with logic (i.e. criterion a). 


1.10 Reading Too Much in the Equations 


In this section we discuss the validity and value of mathematical models and why observations and exper- 
iments should lead the mathematical development of science rather than the opposite. Many equations 
and mathematical models that are developed to describe physical phenomena have limited applicability 
although from a formal perspective they do not look restricted. Despite the fact that mathematical equa- 
tions are legitimate and necessary tools for describing physical phenomena, the two are not equal. This 
means that the mathematical models are not equivalent to the physical phenomena and hence although 
they agree in general they do not necessarily agree in every aspect and detail. For example, the math- 
ematical models may be based on certain assumptions, some of which can be implicit or hidden, and 
hence their extension and generalization cannot be justified. Moreover, mathematics is a highly abstract, 
symbolic and rather simplistic tool while the physical phenomena are usually very complex and messy. 
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Hence, it is very unlikely that a mathematical model can be so comprehensive that it captures and includes 
all the physical factors involved in a given phenomenon and this is particularly true in highly complex 
physical phenomena like gravity and highly abstract scientific theories like general relativity. In brief, to 
construct realistic scientific theories that reflect the reality of the physical world the observations, rather 
than mathematical models and modeling, should lead the scientific research or at least the observations 
should go hand in hand with the mathematical modeling and theoretical reasoning. 

The conclusion from this preamble is that the reality and truthfulness of many of the claimed conse- 
quences and implications of general relativity that are based on analyzing the formalism of this theory 
are questionable. This is particularly true with respect to modern cosmology which is largely guided by 
the theoretical framework and formalism of general relativity. This also applies to many astrophysical 
theories and speculations that are based entirely on the formalism of general relativity and hence they 
represent mathematical and theoretical curiosities rather than physical reality. Accordingly, many of the 
consequences of general relativity that have been inferred from purely theoretical models are questionable. 
Typical examples of this sort of questionable issues are the contemplative details about black holes and 
their physics (see § 8.7) even if we agree on the possibility of the existence of black holes and even if this 
existence is supported (or will be supported) by genuine scientific evidence from astronomical observa- 
tions. In brief, equations (which represent mathematical models and modeling) in science should be read 
and analyzed wisely and sensibly in association with observation, rationality and proper epistemology. 5° 


Problems 


1. Define singularity and distinguish between mathematical singularity and physical singularity. Also, 
discuss the relation between the two. 
Answer: We may define singularity roughly as a situation or condition that leads to the divergence 
of a given quantity and hence it becomes infinite at certain point or region of space. Singularity is also 
used to mean the point in space where the singularity occurs. 
We distinguish between mathematical singularity and physical singularity by considering the former 
as a mathematical requirement obtained from a valid mathematical formulation that leads to the 
singularity while considering the latter as a physical realization of this mathematical requirement 
which means that a given physical property goes to infinity in the real world. For example, we may 
infer from a valid mathematical formulation that the density (or curvature) at the center of black holes 
is infinite and hence we have a mathematical singularity. We may then conclude that the density of 
the actual black holes is infinite (ie. they represent a real mass concentration at a geometric point in 
space with zero volume) and hence we have a physical singularity. 
We believe that mathematical singularity should not mean or imply physical singularity. The reason 
is that real world has no infinite quantities and hence no singularity should exist in reality. The 
truth is that such quantities do exist only in theoretical and hypothetical worlds, e.g. the abstract 
spaces of mathematics. In fact, this is one of the widespread examples of “Reading Too Much in the 
Equations” where it is common in modern physics to conclude physical singularities from mathematical 
singularities. The undisputed existence of accidental (or coordinate) singularity is a clear evidence for 
the misleading ability of mathematics when it applies to physical phenomena since the accidental 
singularity is a mathematical singularity with no corresponding physical singularity because it can 
be removed by replacing the coordinate system. In fact, the conclusion of physical singularities from 
mathematical singularities may be challenged more formally and technically by denying the validity 
of the singularity even mathematically (especially when mathematics is used to describe and quantify 
the real world) because any valid mathematical formulation should exclude such singular points and 
conditions and hence we do not have legitimate and valid mathematical singularities to conclude 
physical singularities from them. 


[56] Unfortunately, the majority of modern physicists (unlike classical physicists) are poor in logic, poor in epistemology and 
poor in philosophy of science and they are indoctrinated to follow the equations literally and make the mathematical 
models lead the observations and that is why modern physics is full of absurdities and mythical objects such as wormholes, 
time machines, strings, membranes, dark matter, dark energy, etc. In fact, we propose including the above subjects (i.e. 
logic, epistemology and philosophy of science) in the curriculum of science education (at least starting from college or 
undergraduate level if not earlier). 
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Note: the reader should note that we use “physical singularity” in a different meaning to its common use 
in the literature. In fact, we use accidental singularity for coordinate singularity, essential singularity 
for what is called by some authors physical singularity, and we use physical singularity to mean a 
singularity in the real physical world. 

Exercises 


1. Give some reasons for our rejection of over-mathematization of science. 


Chapter 2 
Mathematical Background 


In this chapter we summarize the main mathematical concepts and techniques that we need for the 
presentation and formulation of general relativity in a rigorous mathematical form. As we will see, these 
concepts and techniques are mostly imported from tensor calculus and differential geometry of abstract 
multi-dimensional manifolds. The reader, however, should note that the present chapter includes the 
minimum of the most important parts of the required mathematical background and hence many aspects 
of the mathematical background that is needed in the formulation of general relativity are not discussed 
in this chapter (at least in sufficient details) due to the restriction on the size of the book and because 
most of these aspects are treated in my previous books. The reader should therefore consult those books 
(or other books such as those in the References) for the missing materials and details about differential 
geometry, tensor calculus and Lorentz mechanics (see the References in the back of this book). However, 
the reader should note that some of the missing details and supplementary materials are included in the 
exercise book that accompanies this book. 

It is noteworthy that because the materials in this chapter are mostly about abstract spaces in general 
regardless of dimensionality we do not strictly follow the convention about the Latin indices (see § 1.4). 
In fact, we generally use Latin indices in our mathematical formulations even for non-3D spaces unless the 
subject or context dictates otherwise and hence the use of Latin indices in this chapter does not necessarily 
mean that the given formulation is restricted to 3D spaces. This may also extend to some other symbols 
(like ds which is primarily used for 3D spaces) where in certain contexts they are assumed to belong to an 
nD space and hence they are not specific to a space of certain dimensionality. We should also note that 
apart from general mathematical and physical terminology which should be known to the readers at this 
level, the basic terminology in this chapter is mostly explained in the previous chapter while some other 
terminology is explained within its proper context. 


2.1 Space, Coordinate System and Transformation 


In mathematics, space or manifold is an abstract concept whose origin comes from the physical space that 
we live in. So, we can imagine mathematical space as a container of abstract mathematical objects. The 
main property of space is the number of its dimensions (or dimensionality). So, a space with 1 dimension 
is a 1D space, a space with 2 dimensions is a 2D space, and so on. The space can be regarded as an 
assembly of points and hence we need an abstract device to identify and distinguish these points; this 
device is called “coordinate system” which will be investigated later in this section. 

A space is described as Euclidean when it can be coordinated by a Cartesian coordinate system’! and 
described as flat when it can be coordinated by a Cartesian or quasi-Cartesian system (see Problems). If 
the space cannot be coordinated by a Cartesian or quasi-Cartesian system it is described as curved. A Rie- 
mannian space is a space characterized by having a symmetric rank-2 tensor called the metric tensor that 
represents the geometry of the space. Riemannian space may also be characterized (and hence restricted) 
by the condition that it is possible to construct a locally Cartesian coordinate system at any of its points, 
i.e. it is locally Euclidean or flat./58] Riemannian space may similarly be characterized by having a line 
element ds whose square is defined by the quadratic form (ds)? = Gi; da'dx) where gi; is the metric tensor, 


[57] This is equivalent to the condition that there is a permissible coordinate transformation that takes the general quadratic 
form (ds)? = g;j;dax‘dx) of the space to the Cartesian quadratic form (ds)? = dv, (dz*)?. In other words, there is a 
permissible coordinate transformation that takes the metric tensor g;; to the Kronecker delta tensor 6;;. 

[58] We note that there are some abstract metric spaces which are not Euclidean or flat even locally and accordingly they 
are not Riemannian. 
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dx‘ and dzx/ are coordinate differentials and i and j range over the space dimensions.!°9! Conventionally, 


the quadratic form (which is based on the metric tensor) of Riemannian space is positive definite. Hence, 
a space whose characteristics are similar to the characteristics of Riemannian space but its quadratic form 
is not positive definite (i.e. it is Riemannian apart from this mathematical nicety) is commonly known 
as pseudo- or quasi-Riemannian. As indicated earlier, physicists generally do not discriminate between 
Riemannian and pseudo-Riemannian and hence both might be described as Riemannian. 

Coordinate system of a given space is a mathematical device used to identify the location of points and 
objects in the space. Coordinate systems are also needed to define non-scalar quantities (mainly tensors) 
in a specific form and identify their components in reference to the basis vector set of the system. This 
applies in particular to the metric tensor (which represents the geometry of the space) where a coordinate 
system is needed to obtain a specific form of this exceptionally important tensor. An nD space requires a 
coordinate system with n mutually independent coordinates to be fully determined so that any point in 
the space can be uniquely identified by n independent coordinates. Coordinate systems are characterized 
by having coordinate curves and coordinate surfaces. Coordinate curve is a curve along which exactly one 
coordinate varies while the other coordinates are held constant, whereas coordinate surface is a surface 
over which exactly one coordinate is held constant while the other coordinates vary. So, at any point in 
the space we have n independent coordinate curves and n independent coordinate surfaces. 

The n coordinate curves uniquely identify a set of n mutually independent covariant basis vectors E; 
(¢ = 1,--- ,n) which are defined as the tangents to the coordinate curves. Similarly, the n coordinate 
surfaces uniquely identify a set of n mutually independent contravariant basis vectors E’ (i = 1,--- ,n) 
which are defined as the gradients of the coordinate surfaces (or rather gradients of coordinates and hence 
they are perpendicular to the coordinate surfaces). Mathematically, the covariant and contravariant basis 
vectors are given by: 


Or i j 
where r is the position vector (of the point on the i” coordinate curve) in Cartesian coordinates (21,...,2n) 


V is the nabla operator in Cartesian coordinates (of the point on the i*” coordinate surface), and x’ rep- 


resents general coordinates.!©°l Accordingly, the covariant and contravariant basis vectors are generally 
variable functions of coordinates. A coordinate system whose all basis vectors (whether covariant or con- 
travariant) are constant throughout the space is described as rectilinear or affine; otherwise it is described 
as curvilinear. A coordinate system (whether rectilinear or curvilinear) is described as orthogonal if the 
basis vectors (whether covariant or contravariant) are mutually perpendicular at every point in the space. 

In general terms, a transformation from an nD space to another nD space!®!! is a correlation that 
maps the points of the first space to the points of the second space where each point in the first and second 
spaces is identified by n independent coordinates. To distinguish between the two sets of coordinates in 
the two spaces, the coordinates of the points in the second space may be notated with primed symbols 
like (x’',...,2’"), while the coordinates of the points in the first space are notated with unprimed similar 
symbols like (x',...,2”). Barred symbols like (z!,...,Z”) may also be used instead of primed symbols. 
For the transformation to be useful and versatile, it should be bijective and hence an inverse transformation 
from the second space to the first space is also defined, i.e. we have direct transformation from first space 
to second space and inverse transformation from second space to first space. 

In many cases (especially in the theories of relativity), the transformation is not between different 
spaces but it is between different coordinate systems of the same space. However, this does not introduce 


[591 To be more clear, (ds)? is the square of the line element ds while gigdx' dad is the quadratic from (since it is quadratic 
in dz) although it is common to call (ds)? the quadratic form (referring to its mathematical expression gijda'das). 

[69] Cartesian in this definition can be local and hence it applies to curved spaces as well as to flat spaces noting that the 
spaces are presumably Riemannian and hence they can be coordinated locally (at least) by a Cartesian system. It should 
be noted in this context that the basis vectors actually belong to the tangent (or cotangent) space (where they represent 
a link between the Riemannian space at a given point and the tangent space at that point). The reader should be aware 
of the fact that the tangent space at any point of a Riemannian (or pseudo-Riemannian) space is a Euclidean (or flat) 
space of the same dimensionality that represents the local geometry of the space. The existence of the tangent space is 
guaranteed by the Riemannian nature of the space since Riemannian space is locally flat. 

[61] Transformations can also be between spaces of different dimensionality, but this is of no interest to us. 
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a fundamental difference on the aforementioned facts and procedures and hence it could be seen as a 
mere conceptual difference. Accordingly, in this book we generally talk about transformations between 
coordinate systems (or reference frames which are the coordinate systems of spacetime) rather than 
transformations between spaces. 

Mathematically, each one of the direct and inverse transformations can be regarded as a mathematical 
correlation expressed by a set of equations in which each coordinate in one system is considered a function 
of the coordinates in the other system. Hence, the transformations between the two sets of coordinates in 
the two systems can be expressed mathematically in a generic form by the following two sets of independent 
relations: 


or: ee east”) (39) 
C= Bees) (40) 
where 7 = 1,...,n and n is the space dimension. The first equation represents the direct transformation 


while the second equation represents the inverse transformation. 
Problems 


1. What we mean by Cartesian and quasi-Cartesian coordinate systems? 
Answer: We generally mean by Cartesian orthonormal rectilinear coordinate system of nD space with 
a positive definite quadratic form. So, all the coordinate curves of the system are straight lines and 
all the coordinate surfaces of the system are planes. Moreover, the coordinate curves (as well as the 
coordinate surfaces and the basis vectors) of the system are mutually perpendicular and the axes have 
identical length scale. An obvious example of Cartesian system is the familiar (x,y,z) system of 3D 
Euclidean space (which is the space of classical physics and Newtonian gravity). 
Quasi-Cartesian (or pseudo-Cartesian) is the same as Cartesian but it, unlike Cartesian, has a quadratic 
form that is not positive definite. So, the quadratic form of Cartesian is (ds)? = S~"_, (dx')? while the 
quadratic form of quasi-Cartesian is (ds)? = 3°", ¢;(dz’)? with some ¢; being +1 and some ¢; being 
—1. An example of quasi-Cartesian is the coordinate system (or frame) of the Minkowski spacetime. 
We note that because the difference between Cartesian and quasi-Cartesian is not of primary interest 
to physicists in most contexts (although it is of primary interest to mathematicians due to its math- 
ematical nature) we may use Cartesian to mean Cartesian or quasi-Cartesian (and may similarly use 
Euclidean to mean flat). 

2. Give examples of common spaces of different dimensionality and characteristics. 
Answer: 
Plane: 2D flat Riemannian. 
Cylinder: 2D flat Riemannian.|®! 
Sphere: 2D curved Riemannian. 
Space of Euclidean geometry: 3D flat Riemannian. 
Space of special relativity: 4D flat pseudo-Riemannian. 
Space of general relativity: 4D curved pseudo-Riemannian. 

3. Discuss the need for using local coordinate systems. Also, give examples of commonly used local 
coordinate systems. 
Answer: Local coordinate systems are used to have a closer look (or zoom in) into a particular region 
or point of space or to obtain simplified formulation by exploiting the local properties of the space 
which are simpler than its global properties (e.g. flatness of space at local level). 
A prominent example of local coordinate system that is commonly used in the proofs and arguments 
of differential geometry (and hence in general relativity) is the “geodesic coordinates” which can be 
defined as a local coordinate system constructed at and in the neighborhood of a given point P (which 
is called the pole of the geodesic coordinate system) of the space such that all the Christoffel symbols 
vanish at P. Another prominent example is the local inertial frame which is widely used in general 


[62] We note that cylinder is intrinsically flat. Therefore, it can be coordinated intrinsically by a Cartesian system since it 
is isometric to plane and hence the Cartesian system of the plane is mapped onto the intrinsic Cartesian system of the 
cylinder. 
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relativity and its applications (as we saw and will see throughout the book). 

Note: regarding the relation between geodesic coordinate system and local Cartesian (or pseudo- 
Cartesian) system, we note that according to the given definitions and conventions in the literature 
local Cartesian system is a geodesic coordinate system but the opposite is not true in general. Hence, 
geodesic coordinate system is more general than local Cartesian system. 


Exercises 


1. What is the relation between space, coordinate system and transformation? 

2. Characterize the spaces of special relativity and general relativity. 

3. What is the difference between Riemannian space and pseudo-Riemannian space? Link this to the 
spaces of the relativity theories. 

4. Should Riemannian space be curved? 

5. List the main characteristics of Riemannian space.|® 

6. What is the coordinate system of the spaces of special relativity and general relativity? 

7. In the special and general theories of relativity, what transformation from one coordinate system to 
another coordinate system means? 

8. The covariant basis vectors of a given space are transformed from an unprimed coordinate system O 
to a primed coordinate system O’ by the relation E} = oe" Bj. Justify this relation and do the same 
for contravariant basis vectors. 

9. Which space can be coordinated by rectilinear coordinate systems and which space cannot? 

10. Show that E,-E’ = E’-E, = 0; (where E’ and E; are the basis vectors and 6; is the Kronecker delta). 

11. Why should we invent (and hence use) different types of coordinate systems which leads for example 
to the complications of transformation and to potential confusion? 

12. Discuss the relation between the invariance principle and coordinate transformations and link this to 
the nature of the coordinate systems and physical laws. 
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Tensors are mathematical entities that have certain transformation rules. Tensors are required to be 
invariant under certain transformations across coordinate systems, although this property of invariance 
may not be general. For example, a tensor may be invariant (and hence it is a tensor in a technical sense) 
with respect to certain types of transformations (e.g. proper transformations or Lorentz transformations) 
but not with respect to other types of transformations (e.g. improper transformations or Galilean trans- 
formations). It can also be invariant only across certain types of coordinate system (e.g. affine) but not 
across other types of coordinate system (e.g. general curvilinear). 
Tensors are characterized by a number of attributes; the main ones are: 
e Indices which refer to the coordinates of the underlying coordinate system and its basis vectors.|®4] 


For example, in the tensor At the index i refers to the i*” coordinate (or i‘” covariant basis vector) while 


the index j refers to the j*” coordinate (or j'” contravariant basis vector). 

e Order which is represented by the total number of indices whether repetitive (known as dummy) or 
non-repetitive (known as free). For example, the order of the tensor A is zero, the order of the tensor B? 
is two and the order of the tensor Chika is five. 

e Rank which is represented by the number of non-repetitive (or free) indices. For example, the rank of 
the tensor A is zero, the rank of the tensor Bj is two and the rank of the tensor C#,,, is three. The tensors 
of rank-0 are commonly known as scalars while the tensors of rank-1 are commonly known as vectors. 

e Dimension (or dimensionality) which is represented by the number of dimensions of the space which 
the tensor belongs to. For example, in a 2D space (or surface) the dimension of the tensor A;; is 2 while 
in a 4D space (e.g. the spacetime of general relativity) the dimension of A;; is 4. 


[63] This question is about Riemannian as opposite to non-Riemannian and not as opposite to pseudo-Riemannian and hence 
it includes pseudo-Riemannian. 

[64] As we will see, rank-0 tensors (or scalars) have no indices. However, they are still characterized by indices because they 
have 0 indices. 
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e Variance type, i.e. covariant or contravariant or mixed. The variance type is determined by the 
position of the indices, i.e. being lower (like A;) or upper (like B’’) or both (like C%). So, a tensor whose 
all indices are lower is covariant (e.g. Ajjx) and a tensor whose all indices are upper is contravariant 
(e.g. A*) while a tensor that has both lower and upper indices is mixed (e.g. A‘,). It is obvious that 
rank-0 tensors (or scalars) have no variance type because they have no indices, while the rank of the mixed 
tensors should be 2 or higher so that it is possible to have some upper and some lower indices. In fact, the 
use of upper and lower indices is just a notational convention that refers to a more fundamental issue that 
is upper indices are associated with covariant (or tangent) basis vectors while lower indices are associated 
with contravariant (or gradient) basis vectors. Accordingly, the tensor A; means A;E*, the tensor BY 
means BYE,E; and the tensor C’,, means C%,, E;E/ E*. It is common to indicate the variance type of 
a tensor by (mu, ) where n, is the number of upper (or contravariant) indices and uw is the number of 
lower (or covariant) indices. For example, the type of Aj; is (0,2), the type of A’* is (3,0) and the type 
of Ai! is (2,1). 

e Symmetry properties, i.e. being symmetric or anti-symmetric or neither. A symmetric tensor does 
not change if two of its indices are exchanged while an anti-symmetric tensor reverses its sign following 
such an exchange. For example, if Aj; is a symmetric tensor then its components satisfy the relation 
Ajj = Aji while if it is an anti-symmetric tensor then its components satisfy the relation Aj; = —Aji. 
A tensor that does not meet any one of these criteria is neither symmetric nor anti-symmetric. The 
symmetry properties are restricted to tensors of rank-2 and higher and hence scalars and vectors have 
no symmetry properties because they do not have sufficient number of indices to meet the criteria of 
symmetry properties. A tensor whose rank is greater than 2 can be totally symmetric or anti-symmetric 
and it can be partially symmetric or anti-symmetric and hence the symmetry properties apply with respect 
to some (but not all) of its indices. Such a tensor may also be symmetric with respect to some indices and 
anti-symmetric with respect to other indices. For example, if the components of the tensor Ajj, satisfy 
the relation Ajj, = Aji, but not the relation Ajj, = Aix; then it is partially symmetric (i.e. with respect 
to its ij indices). Similarly, if the components of the tensor A;;, satisfy the relation Ajj, = —Aji, but 
not the relation Ajj, = —Ajx; then it is partially anti-symmetric (i.e. with respect to its ij indices). 
Some tensors may have block symmetry properties when the symmetry properties apply not to individual 
indices but to sets (or blocks) of indices. For example, a tensor whose components satisfy the relation 
Aijkt = Axuj has block symmetry with respect to its first two indices and last two indices because the 
exchange of the position of these blocks does not change the tensor. It should be remarked that the 
symmetry properties of tensors are invariant under coordinate transformations and hence a tensor that is 
symmetric /anti-symmetric/neither in a given coordinate system remains so following its transformation 
to other coordinate systems. 

e Number of components (or elements or entries) which is given by the simple formula n” where n is 
the dimension and r is the rank. Accordingly, a scalar has only one component in any space because its 
rank is 0 and hence n° = 1, while a vector has n components (e.g. 2 in 2D space, 3 in 3D space, etc.) 
because its rank is 1 and hence n! = n. Similarly, a rank-2 tensor has n? components (e.g. 2? = 4 in 2D 
space, 3? = 9 in 3D space, etc.). 

e Number of independent (or distinct) components which is the number of non-correlated com- 
ponents. Since the components of symmetric and anti-symmetric tensors are correlated (i.e. by relations 
like Aj; = Aj; and A;; = —A,;), the number of independent components is less than the number of 
components. It can be easily shown (see exercise 2) that the number of independent components of 


rank-2 symmetric tensors is a while the number of independent non-zero components of rank-2 


anti-symmetric tensors is n(n) where n is the dimension. 

e Transformation rules which are the mathematical formulae used to transform tensors from one 

coordinate system to another coordinate system. An arbitrary tensor of type (m,n) which is symbolized 
6M 


with Ae in an unprimed coordinate system O and symbolized with All ‘y in a primed coordinate 
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system O’ is transformed from O to O’ according to the following rule:|®! 
Ox = Aa! Agi Ox” |, 
1I---M __ teem 
eae dxt — Oa™ Oa! Oa!N Ayn ay) 


This rule can be obtained as a generalization of the transformation rules of low-rank tensors with the help 
of the transformation rules of the basis vectors (see exercise 3). It should be noted that the type (m,n) 
includes the case m = 0 (i.e. covariant tensor like A;;) and the case n = 0 (i.e. contravariant tensor like 
A*!). Tn fact, it includes even the case m = n = 0 (i.e. scalar) where the transformation rule becomes 
A =A. 
e Algebraic rules which determine the tensorial nature of the entities obtained by simple algebraic 
operations on tensors. These rules include:!® 

1. Linear combinations of tensors (such as aC+bD where a and 6 are scalars and C and D are tensors) 

are tensors. 

2. Outer and inner products of tensors (e.g. CD and C-D where C and D are tensors) are tensors. 

3. Contractions of tensors (e.g. Ci; where Ci, is a tensor) are tensors. 

4, Permutations of tensors (e.g. Cj; which is a permutation of a tensor Ci,;) are tensors. 
e Differential rules which determine the tensorial and operational nature of the entities obtained by 
differential operations (mainly partial and total derivatives and covariant and absolute derivatives) on 
tensors. Some of these rules are discussed in § 2.7. 
e Metrical rules which determine the tensorial nature of the entities obtained by metrical operations on 
tensors. The main of these rules is that entities obtained by raising and lowering the indices of a tensor 
are tensors, i.e. all variance types of a tensor are tensors (see § 2.5). For example, if Cj; is a tensor then 
C%, and CY are also tensors. 


Problems 


1. Distinguish between general tensors and special (or qualified) tensors. 
Answer: General tensors transform invariantly (according to the tensor transformation rules) under 
all permissible (i.e. differentiable invertible) coordinate transformations, while special (or qualified) 
tensors transform invariantly under special (or qualified) type of transformations or across certain 
types of coordinate systems. For example, the tensors in general relativity are supposed to be general 
tensors, while the tensors in special relativity are tensors under Lorentz transformations and hence 
they are not required to be tensors under general transformations. For this reason the tensors of special 
relativity may be labeled as Lorentzian (or Lorentz) tensors. We can also find in the literature tensors 
that are labeled as Cartesian tensors or affine tensors because they transform as tensors only across 
Cartesian or affine coordinate systems (e.g. the tensors of classical physics and the Christoffel symbols 
which will be investigated in § 2.6). 
Note: the tensors of special relativity are more commonly known as “4-tensors” (although “4-tensors” 
is also used to label the tensors of general relativity which are supposed to be general tensors). So, 
“4-tensors” are special tensors in special relativity and are general tensors in general relativity.|°7 
We should also note that the tensors that transform invariantly across a certain type of coordinate 
systems should also be characterized by a certain type of transformations (which characterize that 
type of coordinate systems) although they are labeled by the type of coordinate systems. 

2. What is the relation between the indices of a tensor and its dimension? 
Answer: Each index of a tensor ranges over its dimension. For example, A? in nD space means 
i=1,---,n andj =1,--- ,n. This should explain why the number of components of tensor are given 
by n” where n is the dimension and r is the rank which represents the number of free indices. 


The use of uppercase letters to index primed symbols in this equation and its alike is for the purpose of more clarity 
although this is not necessary because the indices in the primed and unprimed symbols are totally independent of each 
other, e.g. i in x’ is independent of i in x* and hence they are totally different like i and k. 

In curved spaces, the legitimacy (and sensibility) of the operations that involve more than one tensor may depend on the 
co-locality of the tensors involved in these operations (i.e. the involved tensors correspond to the same point in space). 
For example, adding a tensor at one location to another tensor at another location may not be legitimate and hence it 
is meaningless to determine the tensorial nature of such illegitimate operation. 

As indicated earlier in § 1.3, the “4-” label in general relativity may also refer to the local Lorentzian origin of the quantity. 
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3. Show that the algebraic sum of symmetric tensors is a symmetric tensor. 
Answer: If A” and B” are symmetric tensors then we have A’ = A?’ and BY = BY! and their 
algebraic sum C’ will be C’ = A’? + BY. On shifting the indices of the last equation we get: 


CH = AV + BY = AV + BY = CV 


and hence C’) is also symmetric. We note that equality 2 is justified by the symmetry of A” and 
B*s while equality 3 is justified by the definition of C’’. The above applies similarly if the tensors are 
covariant. 

Note: although we used rank-2 contravariant tensors in the above demonstration, the result can be 
easily generalized to tensors of any rank and covariant type. We should also note that the above 
demonstration can also be easily applied to anti-symmetry, that is if A’? and BY are anti-symmetric 
tensors and C” is their algebraic sum then we have: 


CH = Ad + Bo = (—A%) + (—BY) = — (AY + BY) = C8 


In fact, we can even generalize “algebraic sum” to “linear combination”. 

4. What is the quotient rule of tensors? 

Answer: The quotient rule of tensors states: if the inner product of a suspected tensor by a known 
tensor is a tensor then the suspect is a tensor. For example, if A is a suspected tensor and B and C 
are known tensors and we have A-B = C (where the dot represents inner product) then A is a tensor. 

5. What is the important property of the zero tensor? What is the implication of this? 

Answer: It is the value-invariance (or constancy) of its components across all coordinate systems (i.e. 
its components are zero in any coordinate system). In other words, if a tensor vanishes (i.e. it is a 
zero tensor) in one coordinate system then it vanishes in any other coordinate system, |68 

The implication of this is that a tensorial relation (or equation) that is valid in one coordinate system 
should hold in all other coordinate systems due to its tensorial (invariant) nature since it can be 
expressed as a tensorial relation of the zero tensor. For example, if we have the tensorial relation 
A =B ina given coordinate system then we can put it in the form A —B = 0 and hence it should be 
valid across all coordinate systems due to the value-invariance of the zero tensor. In fact, the invariance 
of tensorial relations can also be easily deduced from the transformation rules of tensors (as will be 
seen next). 

6. Show that if two tensors of the same type (i.e. same rank, variance type, etc.) are equal (i.e. have 
identical corresponding components) in one coordinate system then they should also be equal in any 
other coordinate system. 

Answer: This may be “shown” easily by noting that the supposedly “two tensors” are actually identical 
(because they are equal) and hence they represent the same tensor.!®9l_ Therefore, the statement is 
self-evident (or even trivial) without need of a technical proof. However, in the following we show this 
formally (for the sake of clarity and rigor). 

Let At” and Bee be two such tensors in coordinate system O (unprimed) and hence we have 


ieee 
Aa = Bey On transforming this relation to another coordinate system O’ (primed) by multiplying 
the two sides by the Jacobian factors Oa" ‘ oo au" o a vee fis (see Eq. 41) we get: 

Ox Aa! Agi Ox" Aiem Ox = Aa!™ Aa OM Bie 

aati Oa™ AalT  AaIN TI” ~ Agi gm AglT OgiN Bin 


[68] This may be expressed by saying: a non-zero tensor cannot be transformed to a zero tensor and vice versa, (i.e. a non-zero 
tensor cannot be transformed away and a zero tensor cannot be transformed to a non-zero tensor). We should also note 
that the value-invariance property of the zero tensor also applies to its various variance types, i.e. if one variance type 
of a tensor is zero then all its variance types are also zero (e.g. if Ajj = 0 then At, = g* Ag; = 0). 

[69] Th fact, this may be questioned because the equality may not imply identicality (especially when considering the difference 
between mathematical and physical perspectives). For example, two objects may be mathematically equal but they are 
physically (or even mathematically) different, e.g. they represent realistically-different physical entities or conceptually- 
different mathematical entities. Anyway, this is a trivial matter and should not concern us. 
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As we see, each side of the last equation represents the transformation of a tensor from system O to 
system O’ and hence we should have: 

Aloe M — pll-M 

JN J--N 

i.e. the two tensors are also equal in system O’. Now, since O’ is an arbitrary system then the two 
tensors should be equal in all systems. 
Note: the presentation of the above proof may be simplified by the use of the zero tensor as follows: 
Since A and Bee are equal (i.e. Ar = Bie), then Ave = Be = 0 and hence this zero 
tensor can be transformed invariantly to any other system and accordingly At _ Be = 0 which 
leads to Aen = WB gee as required. However, this requires the applicability of the transformation 
to the individual terms and hence it is the same as the above demonstration. 


7. How are index shifting and index replacement operations conducted? 
Answer: Index shifting operations are conducted by contracting an index of the metric tensor with 
an index of another tensor (i.e. by inner product). So, in lowering a contravariant index the covariant 
metric tensor is used (e.g. A; = gi;A’) while in raising a covariant index the contravariant metric 
tensor is used (e.g A’ = g’ A;). 
Index replacement operations are conducted by contracting an index of the Kronecker delta tensor 
with an index of another tensor, e.g. A’ = 0;A’ and A; = 6;Aj; where the index j in tensor A is 
replaced by the index 7. 
Note: index replacement operation is conducted by using the Kronecker delta tensor in its mixed form 
(i.e. 67) which is the same as the mixed metric tensor g} (see § 2.5). Accordingly, we can say that all 
the above operations (i.e. index shifting and index replacement) are conducted by using the metric 
tensor, i.e. the covariant metric tensor for lowering index, the contravariant metric tensor for raising 
index and the mixed metric tensor for replacing index. 
Exercises 
1. Justify the fact that the number of components of a rank-r tensor in an nD space is given by n”. 
2. Justify the fact that the number of independent components of rank-2 symmetric tensors is n(n) 
while the number of independent non-zero!”®! components of rank-2 anti-symmetric tensors is ae 
3. Find the transformation rules of a vector A between unprimed and primed coordinate systems and 
hence generalize these rules to tensors of any rank. 
4. Distinguish between covariant and contravariant tensors using their transformation rules. 
5. Show that the zero tensor is value-invariant (i.e. constant) across all coordinate systems. 
6. What you notice about the tensor relation Agm = 0? 
7. Show that linear combinations of tensors are tensors. 
8. Show that outer products of tensors are tensors. 
9. Show that contractions of tensors (i.e. mixed tensors of rank > 1) are tensors. 
10. Show that inner products of tensors are tensors. 
11. Show that permutations of tensors are tensors. 
12. Show that the symmetry properties (i.e. being symmetric/anti-symmetric/neither) of tensors are 
invariant under coordinate transformations. 
13. Why in the contraction operation of tensors the contracted indices should be of opposite variance type 
(i.e. one upper and one lower)? 
14. Show that all the variance types of a tensor are tensors. 
15. Discuss the nabla and Laplacian operators in 4D spacetime. 
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The curvature of space can be an intrinsic property and hence it can be detected and measured by an 
inhabitant of the space. An example of this is the Gaussian curvature of a surface (such as a sphere) 


[70] 


“Non-zero” here means they are not necessarily zero due to the anti-symmetry requirement although they could be zero 


accidentally or for another reason. 
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which is an intrinsic property that can be detected and measured by a 2D inhabitant of the surface. 
The curvature of space can also be an extrinsic property and hence it is detected and measured by an 
inhabitant of the embedding space. For example, cylinder has no intrinsic curvature and hence a 2D 
inhabitant of the cylinder will see it flat like a plane with no curvature. However, from the 3D embedding 
space that contains this cylinder it is a curved surface, unlike plane, and hence a 3D inhabitant can detect 
and measure its curvature. 

We note the following about the curvature of space: 
e A space can be flat intrinsically and extrinsically (e.g. plane), or flat intrinsically and curved extrinsically 
(e.g. cylinder), or curved intrinsically and extrinsically (e.g. sphere). In fact, these are the common and 
familiar cases of space curvature. So, we may also find in the literature examples of spaces that are curved 
intrinsically but not extrinsically (although some of these examples are questionable). Also, there are 
examples of mathematical spaces that cannot be embedded in an embedding space and hence there is no 
meaning for them to have (or not have) extrinsic curvature although it is still meaningful to characterize 
them as curved spaces (i.e. intrinsically). 
e The curvature of interest in special relativity and in general relativity is the intrinsic curvature and not 
the extrinsic curvature.!™! In fact, we lack the ability to envisage the 4D spacetime of special relativity 
and general relativity let alone embedding this spacetime in a space of higher dimensionality. 
e The intrinsic curvature is mainly quantified by the Riemann-Christoffel curvature tensor (see § 2.10) 
although it may also be quantified by other parameters that are related to this tensor (e.g. the Gaussian 
curvature in the case of 2D spaces). Therefore, in general relativity the Riemann-Christoffel curvature 
tensor is the main building block for the formulation of spacetime curvature (see § 2.13). In fact, we will see 
that the Einstein tensor which defines the curvature of spacetime in the Field Equation of general relativity 
can be defined by the Christoffel symbols directly without resort to the Riemann-Christoffel curvature 
tensor. However, the Ricci curvature tensor and scalar (which enter in the definition of the Einstein tensor) 
are originally defined by the Riemann-Christoffel curvature tensor and hence the connection between the 
curvature of spacetime in general relativity and the Riemann-Christoffel curvature tensor is established. 
e A curved space may have constant curvature all over the space (e.g. sphere), or have variable curvature 
and hence the curvature is position dependent (e.g. torus). 
e There are several indicators and conditions for testing the intrinsic curvature and flatness of space. Some 
of these indicators and conditions will be briefly discussed in the questions and some will be investigated 
in detail later. 


Problems 


1. Briefly discuss the commonly used indicators and conditions for testing the intrinsic curvature and 
flatness of space. 
Answer: These include: 
e Special geometric tests such as the sum of the angles 6, of triangles in the space and the relation 
between the circumference C and the diameter D of circles in the space. So, in curved space 6, 4 7 and 
C # xD while in flat space 0, = 7 and C = 7D. In fact, these conditions reflect the local curvature 
and flatness (i.e. where the triangle and circle in our examples are located). The space is then qualified 
as curved if it is curved somewhere and as flat if it is flat everywhere. We should also remark that 
although the conditions in flat space (ie. 6, = a and C = 7D) are obviously valid everywhere, the 
validity of the conditions in curved space (i.e. 0, #4 7 and C 4 7D) at every locally curved location 
are not obvious and hence they need to be established rigorously (e.g. in the case where the location 
is negatively curved in part and positively curved in other part). 
e Coordination by Cartesian system (see § 2.1). So, curved space cannot be coordinated by Cartesian 
system while flat space can be coordinated by Cartesian system. It is noteworthy that Cartesian in 
this context includes quasi-Cartesian (and may even be generalized to rectilinear). We should also 
note that coordination here means globally since curved space can be coordinated by Cartesian system 
locally but not globally. 
e Riemann-Christoffel curvature tensor (see § 2.10). So, this tensor does not vanish identically in 


[73] So, flatness in special relativity means intrinsic flatness, i.e. vanishing of intrinsic curvature. 
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curved space, while it does vanish identically in flat space. In some cases this test also applies to other 
parameters of curvature, such as the Gaussian curvature of 2D spaces, that are based on the Riemann- 
Christoffel curvature tensor. It should be remarked that the Gaussian curvature in differential geometry 
is defined for 2D spaces although the concept may be extended to manifolds of higher dimensionality in 
the form of Riemannian curvature (see B2). We should also note that the Riemann-Christoffel curvature 
tensor in 2D space has only one independent component and hence the vanishing of Gaussian curvature 
is equivalent to the vanishing of this tensor. 

e Parallel transport (see § 2.8). So, in flat space a parallel-transported vector around a closed curve 
returns to its initial position in the same direction while in curved space it does not necessarily return 
so. 

e Metric tensor (see § 2.5). So, if all the components of the metric tensor are constants then the 
space is flat but the opposite is not true, i.e. the metric tensor of flat space may not be constant (e.g. 
Euclidean space coordinated by spherical coordinate system). In brief, the constancy of the metric 
tensor is a sufficient but not necessary condition for the space to be flat (and hence the value of this 
test is limited) .!7! 

e Christoffel symbols (see § 2.6). So, if all the Christoffel symbols of a given coordinate system of a 
space vanish identically then the space is flat; however the opposite is not true, ie. the Christoffel 
symbols may not vanish in a flat space if the coordinate system is curvilinear (e.g. Euclidean space 
coordinated by spherical coordinate system). In brief, the vanishing of Christoffel symbols is a sufficient 
but not necessary condition for the space to be flat (and hence the value of this test is limited). In 
fact, this test is based on the previous test since the Christoffel symbols are synthesized from the 
components of the metric tensor and their derivatives. 

We should remark that a space is flat only if it is globally flat (i.e. it is flat everywhere) while it is curved 
even if it is partially curved (i.e. it is curved somewhere). Hence, all these indicators and conditions 
should comply with this provision (or rather convention) when they are used to verify the flatness 
or curvature of space. For example, a space whose Riemann-Christoffel curvature tensor vanishes at 
certain location (but not globally) is not flat. In fact, adopting this convention should depend on the 
purpose and hence we may adopt a convention that is based on the local state of the space (i.e. curved 
and flat have only local significance) if it suits us. We may also use “curved” and “flat” only in global 
sense and hence we may have “partially flat and partially curved spaces”. 


Exercises 

1. Justify the indicators and conditions (for testing the intrinsic curvature and flatness of space) that we 
stated in the Problems. 

2. Give an example of an indicator for testing the intrinsic curvature and flatness of space other than 
those given in the Problems. 

3. Give an example of a special geometric test (other than those given in the Problems) for testing the 
intrinsic curvature and flatness of space. 

4. How do you define triangle and circle in curved spaces? 

5. In special and general relativity theories, what we mean by the curvature of space and why? 

6. Give simple examples of how a 2D inhabitant of a surface can detect the intrinsic curvature of his 2D 
space by conducting simple measurements. 

7. Briefly discuss the coordination by a Cartesian system as a sign for the curvedness and flatness of 
space. 

8. How can the Riemann-Christoffel curvature tensor be used for testing the curvedness and flatness of 
space? 

9. Flat spaces may be metricized by non-constant metric tensor (i.e. tensor whose components are func- 


tions of coordinates). For example, 3D Euclidean space can be coordinated by a spherical coordinate 
system whose metric gj; is diag [1,r?,r? sin” 6]. Comment on this. 


[72] The constancy of the metric tensor means the constancy of its individual elements although these elements are generally 


different. 
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2.4 Variational Principle and Euler-Lagrange Equation 


In many mathematical and physical situations we need to find the extremum (or optimum) points of a 
given function!”*! where the function reaches its maximum or minimum values (possibly locally), and this 
is achieved by using the variational principle. The essence of the variational principle is that the variation 
of the function at its extremum points vanishes. This principle is intuitive because at an extremum 
point the function should change its trend (i.e. from increasing to decreasing or from decreasing to 
increasing) and hence at the extremum point the function should cease to vary abruptly. In other words, 
a positive/negative variation trend followed by a negative/positive variation trend should be separated by 
a zero variation “trend”. 

The Euler-Lagrange equation is a relation that implements the variational principle in a specific math- 
ematical form when the function that should be optimized is a functional. More specifically, the Euler- 
Lagrange equation is a mathematical relation whose objective is to minimize or maximize a certain func- 
tional F(f) which depends on a function f. It is represented mathematically by a differential equation 
whose solutions optimize the particular functional F. The Euler-Lagrange equation in its generic, simple 


and most common form is given by: 
) d (0 
OF te (54) =( (42) 


OYx 


where f(x,y, Yx) is a function of the given variables that optimizes the functional F’, y is a function of a, 
and yz is the derivative of y with respect to x. 


Exercises 


1. What is the relation between the variational principle and the Euler-Lagrange equation? 
2. What is the role of the Euler-Lagrange equation in implementing the variational principle? 


2.5 Metric Tensor 


The metric tensor is a rank-2 symmetric differentiable invertible tensor that reflects and represents the 
geometric properties of the space. The main objective of the metric is to generalize the concept of length 
to general coordinate systems and hence maintain the invariance of length in different coordinate systems 
(see next paragraph). The metric tensor is also used to raise and lower indices and thus facilitate the 
transition between the covariant and contravariant types. As a tensor, the metric has geometric significance 
regardless of any coordinate system although it requires a coordinate system to be represented in a specific 
form. So in brief, the coordinate system and the space metric are independent but correlated entities. 
More specifically, the metric uniquely determines the space and identifies its geometric properties, but the 
opposite is untrue (i.e. the geometric properties of the space do not uniquely identify its metric) because 
the metric also depends on the choice of coordinate system, and the coordinate system for a given space 
is not unique. 

In an orthonormal Cartesian coordinate system of an nD space the length of infinitesimal element of arc, 
ds, connecting two neighboring points in the space, one with coordinates x; and the other with coordinates 
xu, +dx; (t=1,--- ,n), is given by: 


where 6;; is the Kronecker delta. This definition of length is the key to the introduction of a rank-2 tensor, 
gij, called the metric tensor which for a general coordinate system is defined by: 


(ds)* = gijsda' dx! (44) 


where the indexed x represent general coordinates. Accordingly, we may consider the metric tensor as 
a generalization of the Kronecker delta tensor that is associated with orthonormal Cartesian coordinate 


[73] We note that “function” in this context is more general than “simple function” or “function of function” where the latter 
is usually known as functional. So, f(a) is a simple function while F(f(x)) is a functional. 
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systems (see Problems). The metric tensor in the last equation is of covariant form, however it has also 
a contravariant form (which is notated with g‘’) and a mixed form (which is notated with g?). 

The components of the metric tensor in its covariant, contravariant and mixed forms are closely related 
to the basis vectors of the coordinate system, that is: 
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where the indexed E are the covariant and contravariant basis vectors and where in the second equalities 
we used the definition of these basis vectors as given by Eq. 38 (noting that x’ and 2) represent general 
coordinates while x; represents Cartesian coordinates whose variance type is irrelevant). As we see, the 
mixed metric tensor gi is the same as the identity tensor 6% (i.e. Kronecker delta). 

The metric tensor is necessarily symmetric with non-zero diagonal elements but not necessarily diago- 
nal.!"41 Moreover, for the subject of this book (i.e. general relativity as well as special relativity to some 
extent) the diagonal elements of a diagonal metric tensor are not necessarily positive.!7°] The last two 
statements obviously apply to the covariant and contravariant metric tensor but not to the mixed metric 
tensor which is the identity tensor and hence it is symmetric, diagonal with positive diagonal elements. 

The covariant and contravariant forms of the metric tensor are inverses of each other and hence we have 
the following relations: 


lal = [9"] [97] = [oad (48) 
where [---] symbolizes matrix while [---]~+ symbolizes its inverse. Therefore: 
[9°*] 9s] = [aie] [9""] = [55] (49) 


Since the metric tensor is invertible, its determinant should not vanish at any point in the space. Also, 
being invertible means that the metric tensor has always covariant and contravariant forms. 

Apart from its conceptual significance (as representative of the space geometry) and apart from its 
practical and theoretical use in computing the line element (i.e. ds), the metric tensor is commonly used 
as an operator for lowering and raising indices (i.e. covariant metric tensor lowers indices and contravariant 
metric tensor raises indices) and hence facilitating the transition between the covariant and contravariant 
types. Consequently, in Riemannian space tensors can be cast into covariant or contravariant or mixed 
form. This means that the different forms of a tensor are equivalent and hence they represent the same 
tensor but in reference to different basis vector sets (i.e. covariant refers to contravariant basis and 
contravariant refers to covariant basis as seen in § 2.2). 

As indicated earlier, the metric tensor is a variable function of coordinates in general. However, in 
rectilinear coordinate systems the metric tensor is constant because the basis vectors in these systems 
are constants. As we will see, a sufficient and necessary condition for the metric tensor to be constant 
in a given coordinate system is that the Christoffel symbols of the first or second kind vanish identically 

The metric tensor in general (i.e. whether variable or constant) behaves like 
a constant with respect to tensor differentiation and hence the covariant and intrinsic derivatives of the 
metric tensor vanish identically (see § 2.7). 


Problems 


1. Compare the concept of “transformation” with the concept of “shifting variance type”. 
Answer: We note the following: 


[74] The necessary and sufficient condition for the metric tensor to be diagonal is that the coordinate system is orthogonal. 
[75] As indicated earlier (see § 1.3 and § 2.1), being necessarily positive (i.e. having positive definite quadratic form) or not 
depends on the space being Riemannian or pseudo-Riemannian. 
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e Transformation is an operation for shifting from one coordinate system to another coordinate system, 
while shifting variance type (by using the metric tensor as an index shifting operator) is an operation 
for shifting from one type of basis vectors (i.e. covariant or contravariant) of a given coordinate system 
to another type of basis vectors of that coordinate system. So in brief, transformation means changing 
coordinate system while shifting variance type means changing basis vectors of the same coordinate 
system. Hence, in shifting variance type both the covariant and contravariant types belong to the 
same coordinate system. However, we should notice the significance of the index shifting operation of 
covariant and contravariant metric tensor (see the next two problems). 

e From a procedural perspective, in transformation we use a set of equations (e.g. Cartesian-to- 
spherical transformations or Lorentz transformations) ,|76 while in shifting variance type we use the 
metric tensor. 

e The subject of transformation is coordinate system, while the subject of shifting variance type is 
tensor (or tensor-like entity). 

2. Show that the metric tensor can be regarded as a transformation of the Kronecker delta tensor in 
its different variance types from an orthonormal Cartesian coordinate system to a general coordinate 
system. 

Answer: From the definition of the metric tensor in its different variance types, we have: 
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where x’ and x/ represent general coordinates while x* represents Cartesian coordinates (whose vari- 
ance type is irrelevant). These equations can be written as (noting that the indices k and / refer to 
Cartesian coordinates): 
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where they are justified by the fact that the Kronecker delta tensor is an index replacement operator 
plus the fact that the corresponding components of all the variance types of the Kronecker delta are 
the same, ie. 6;; = 6 = 6%. As seen, the relations in Eq. 50 are the transformation equations of 
the Kronecker delta tensor in its different variance types from an orthonormal Cartesian coordinate 
system to a general coordinate system, as required. In other words, the relations in Eq. 50 are the 
transformation rules of the metric tensor in orthonormal Cartesian coordinate system (which is the 
Kronecker delta tensor 5,1,6*', df) to the metric tensor in general coordinate system (i.e. gij, 9", 94). 
We note that in curved space this transformation applies only locally. 

3. What is the significance of raising an index of the covariant metric tensor gj, and lowering an index 
of the contravariant metric tensor g/*? 

Answer: Using g’ as an index raising operator, we have: 


9 95k = Ie = 9% 
Similarly, using g;; as an index lowering operator, we have: 


ou” = 9° = jj 
Hence, these operations mean taking the metric tensor back to its original Cartesian form (i.e. Kro- 
necker delta) which the tensor originates from (according to the previous problem). In other words, 
we are reversing the transformation of the previous problem. This should be seen as a link between 
changing variance type (although restricted here to the metric tensor) and transforming between coor- 
dinate systems (although restricted here to Cartesian and locally), and hence it is related to the first 
problem. 


[76] Ty fact, in transformation we use the Jacobian matrix which is derived from the transformation equations (see B3 and 
B3X). 
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In fact, this should be more appreciated if we note that the covariant and contravariant forms of the 
metric tensor are inverses of each other and hence if one of these forms corresponds to a transforma- 
tion (of the Kronecker delta) from an orthonormal Cartesian coordinate system to a general coordinate 
system then the other form (when applied to the first form) should correspond to an opposite transfor- 
mation from a general coordinate system to an orthonormal Cartesian coordinate system (i.e. taking 
the metric back to its origin which is the Kronecker delta). Also, see question 4.52 of B3X. 


Exercises 


Explain the relation between space, coordinate system and metric tensor. 

What are the physical dimensions of the metric tensor? 

Show that Gig = E; 2 E;. 

Show that the metric is a rank-2 tensor. 

Show that the Kronecker delta is a rank-2 tensor. 

Why is the metric tensor symmetric? 

Summarize the main properties of the line element ds in Riemannian space highlighting its relation to 

the metric tensor. 

8. Show the invariance of the quadratic form (ds)” across coordinate systems. 
9. Give the metric tensor of the 3D Euclidean space for the following coordinate systems: orthonormal 

Cartesian, cylindrical and spherical. 

10. Give the metric tensor of the 4D flat spacetime of special relativity using the following 3D spatial 
coordinate systems: orthonormal Cartesian, cylindrical and spherical. 

11. Justify the use of the Kronecker delta as an index replacement operator. 

12. Show that the covariant and contravariant components of a vector v can be obtained from the relations: 
v-E; =v; and v- FE’ =v". 

13. Justify the use of the metric tensor as an index shifting operator. 

14. Show that in orthonormal Cartesian systems of Euclidean spaces there is no difference between the 
covariant and contravariant types of tensors. 

15. Show that the metric tensor is diagonal iff the coordinate system is orthogonal. 

16. Show et He constancy of the metric tensor is a sufficient but not necessary condition for the space 

to be flat.!77 
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2.5.1 Metric Tensor of 4D Spacetime of General Relativity 


Unlike the metric tensor of the flat spacetime of special relativity which has a general form (assuming a 

particular underlying 3D spatial coordinate system), the metric tensor of the curved spacetime of general 

relativity has no such general form because it depends on the distribution and flow of matter and energy 

as described by the energy-momentum tensor (see § 2.14 and § 3.2) 78] Accordingly, we should search 

for the metric tensor for each individual case and this depends on the physical setting and the adopted 

assumptions (as well as the employed coordinate system). We will see a few examples of the metric tensor 

of general relativity for some special and simple cases, e.g. Schwarzschild metric (see § 4.1). 

Exercises 

1. What we mean by “general form” when we say “the metric tensor of the flat spacetime of special 
relativity has a general form’? 

2. Compare the metrics of special relativity and general relativity. 

3. What is the implication of not having a unique and general form of the metric in general relativity? 


[77] As indicated earlier, the constancy of the metric tensor means the constancy of its individual elements. 
I78] Tn fact, this is a physical argument based on the validity of general relativity and its Field Equation. We can also have 
a purely geometric argument based on the fact that curved geometry, unlike flat geometry, is not unique. 
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2.6 Christoffel Symbols 


The Christoffel symbols are rank-3 affine tensors but they are not general tensors (see exercise 12). As a 
consequence of not being general tensors, if all the Christoffel symbols of either kind vanish in a particular 
coordinate system of a given space they will not necessarily vanish in other coordinate systems of that 
space. For instance, all the Christoffel symbols vanish in Cartesian coordinate systems of 3D Euclidean 
space but not in cylindrical or spherical coordinate systems of this space (see B3). As we will see, the 
Christoffel symbols solely depend on the metric tensor and hence they are intrinsic properties of their 
space. The Christoffel symbols are classified as those of the first kind and those of the second kind. These 
two kinds are linked through the index raising and lowering operators. Both kinds of the Christoffel 
symbols are variable functions of coordinates since they depend on the metric tensor which is coordinate 
dependent in general. 
The Christoffel symbols of the first kind are given by: 


1 
5 (O; gi + i951 — W9:3) (51) 


where the indexed g is the covariant metric tensor. The Christoffel symbols of the second kind are obtained 
by raising the third index of the Christoffel symbols of the first kind, and hence they are given by: 
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where g*! is the contravariant metric tensor. This process is reversible and hence the Christoffel symbols 
of the first kind can be obtained from the Christoffel symbols of the second kind by lowering the upper 
index, that is: 


(Oj gi + 0:95 — O19i3) (52) 


GkmV 5 = Gemg" [15,1] = Frm [15,0] = [i5, m] (53) 
The Christoffel symbols of the first and second kind are symmetric in their paired indices, that is: 
[ijk] = (gt, (54) 
ko _ pk 
Pe (55) 


These properties can be verified by shifting the indices in the mathematical expressions of the Christoffel 
symbols, as given by Eqs. 51 and 52, noting that the metric tensor is symmetric (see § 2.5).!79! 

In any coordinate system, all the Christoffel symbols of the first and second kind vanish identically ff all 
the components of the metric tensor in that system are constants (see exercise 7). Now, in affine coordinate 
systems all the components of the metric tensor are constants and hence all the Christoffel symbols of both 
kinds vanish identically. The prominent example is the orthonormal Cartesian coordinate systems where 
all the Christoffel symbols of the first and second kind are identically zero. In an nD space, the number of 
the Christoffel symbols of either kind is n? which represents all the possible permutations of the 3 indices 
including the repetitive ones. However, because of the symmetry property in their paired indices the 
number of independent Christoffel symbols of each kind in general coordinate systems is reduced to:|®°! 


n? (n+ 1) 
2 


N= (56) 


Problems 


1. What are the other names given to the Christoffel symbols of the second kind in the literature of 
general relativity? 
Answer: The Christoffel symbols of the second kind are known as affine connections or connection 
coefficients or metric connections, as well as other names. In fact, Christoffel symbols and those terms 
(such as connection coefficients) are not the same conceptually although for our purpose in this book 
their difference can be ignored and hence we treat them as identical. In fact, the Christoffel symbols 
should be considered a special type of connection coefficients. 


(791 This is demonstrated in question 5.10 of B3X. Also see Problem 4. 
[80] This is proved in question 5.28 of B3X. 
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2. Justify the fact that the Christoffel symbols are an intrinsic property of the space. 
Answer: This can be easily concluded from Eqs. 51 and 52 since the Christoffel symbols of both 
kinds can be expressed solely in terms of the metric tensor and its derivatives, and the metric tensor 
is obviously intrinsic. 

3. Develop the concept and fundamental definition of the Christoffel symbols. 
Answer: The partial derivatives of the basis vectors of a given coordinate system (whether rectilinear 
or curvilinear) of a given space (whether flat or curved) are vectors that belong to the space and hence 
they can be expressed as linear combination of the basis vectors of the space.!®!!_ To cast this fact into a 
rigorous and fully informative mathematical form we write the following equation: 0;E; = TEx which 
can be considered as the fundamental definition of the Christoffel symbol rk. As we see, rk symbolizes 
the component of the partial derivative of the basis vector E; with respect to the j*” coordinate a in 
the direction of the basis vector E;, and hence it is fully informative with no ambiguity although we 
still need to develop a specific mathematical expression for this symbol. 
Now, we should develop a similar fundamental definition for the Christoffel symbol in terms of the 
contravariant basis vectors, that is: 


E,-E’ = & 
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where line 1 is the reciprocity relation between covariant and contravariant basis vectors, line 3 is the 
product rule plus the fact that the components of the unity tensor are constants, in line 4 we use 
the above fundamental definition of the Christoffel symbols in terms of the covariant basis vectors 
(ie. 0;E; = TEE), line 5 is the reciprocity relation, in line 6 we use Kronecker delta as an index 
replacement operator, and line 8 is another form of line 7 (noting that line 7 can be obtained from 
line 8 by inner product of both sides with E, after changing the dummy index k).|8?]_ Hence, the 
fundamental definition of the Christoffel symbols is given by the following basic relations: 


0,;E; =T%.E, and 0;E’ = -T;,,B* 
The next thing we need to do is to obtain a completely “covariant” version of the above Christoffel 
symbols. This is done by lowering the upper index using the covariant metric tensor, that is: 


We note that in the above developments we are assuming that rk has certain tensor-like properties 
such as shifting indices by the metric tensor. This could be justified by the fact that these indices refer 
to covariant and contravariant basis vectors and hence they should be treated like tensor indices. We 


[81] We are considering a space of sufficient dimensionality from an intrinsic perspective with no regard to embedding in a 
space of higher dimensionality. Also, there are theoretical subtleties about how the vectors belong to the space and how 
they can be expressed as linear combination of the basis vectors (where the projection onto the tangent space at the 
point of application is considered in addressing these subtleties). However, we follow an intuitive approach and hence 
we do not go through these details. In brief, we can take the above premise (i.e. the partial derivatives of the basis 
vectors of a given space are vectors that belong to the space) as assumption in the development of our definition of the 
Christoffel symbols. 

[82] Tn fact, this in essence is the same as the relation v- E; = v; (or V- Ey = V,"Ex) which we proved in exercise 12 of § 
2.5. Also see exercise 10. 


2.6 Christoffel Symbols 60 


should also note that ry, is affine tensor which will become clear when we find specific mathematical 
expressions for the Christoffel symbols. 
Finally, we need to find specific mathematical expressions for the Christoffel symbols of both versions 
(or kinds), i.e. [¢7, k] and rk. This will be dealt with in the next problem. 

4. Using the fundamental definition of rk (i.e. O;E; = TEx) and the definition of [17,1] (ie. [47,U] = 
ImiV77), show that: 
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Answer: First, we should note that: 


TEx = 0; E; O;Oir O,0;r OE; = TEx 


and hence the Christoffel symbols are symmetric in their paired indices, i.e. ry, = rk. Moreover, 


since If, is symmetric in its paired indices then [ij,k] (which is obtained from If, by lowering the 
non-paired index) should also be symmetric in its paired indices (see the upcoming note). 
Now, we have: 


gi = E,-E, 
Ojgi = Oj; (BE; - Ey) 
Ojgu = (0;E;)-E, + E; - (0;Ei) 
Ogu = (UEx) +E, +E; - (UfEx) 
Aig = VE om + fi gir 
Ojga = [19,1] + [19,7] 


where line 1 is the relation between the metric coefficients and the basis vectors (see Eq. 45), line 3 is 
the product rule, line 4 is the fundamental definition of the Christoffel symbols, and line 6 is using the 
metric tensor as an index lowering operator (in conjunction with the definition of the symbols [ij, 1] 
and [lj,7]). On relabeling the indices in the last equation, we get two other relations: 


Og; = [94,0] + le, 3] 
Ogiy = [l,j] + [91,7] 


On adding these three relations algebraically we get: 
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[7,/] = 5 (0; 9: + O:gj1 — O19%3) 


which completes the proof of the first part of the problem. We note that in line 2 we use the afore- 
mentioned symmetry property of the Christoffel symbols in their paired indices. 
Now, if we multiply the last equation with g*! we get: 
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which completes the proof of the second part of the problem. 

Note: we should note that deviation from symmetry in the paired indices of the Christoffel symbols 
(or rather connection coefficients to be accurate) is considered in the literature (although from different 
perspective and starting point) where this deviation is quantified by the so-called torsion tensor which 
is the difference between rk and ry, (i.e. rk - r¥,). However, we do not go through these details. 
In brief, we assume that the torsion tensor is identically zero (i.e. the space is torsion-free) and the 
covariant derivative of the metric tensor is also identically zero, and hence we have “metric connections”. 

5. Show that in orthogonal coordinate systems, the Christoffel symbols of the first kind are given by: 


(77, k] =0 (GAG #K) 
1 
(24, k] = — 5 Ons (i # k, no sum on 7) 
1 
[23,7] = [72,4] = iH (i 4 Jj, no sum on #) 
1 
[i,t] = isi (no sum on 7) 


Answer: The Christoffel symbols of the first kind are given by: 


1 
ie 
[i7, k] 5 


(Ojgik + Oigjk — OnGiz) 


In orthogonal coordinate systems the metric tensor is diagonal (see exercise 15 of § 2.5) and hence 
Jab = 0 when a ¥ 6b. Accordingly: 

e When all the indices are different (i.e. 1 # 7 Ak) then giz = gjx = Gi; = 0 identically and hence 
ij, k] = 0. 

e When the paired indices are identical (i.e. i = 7 and i # k) then: 


fs, R] = > 


1 1 
9 (Oigik + O:Gir — Ongii) = 3 (0 +0 — Opgii) = 5 Gi (no sum on 7) 


e When two unpaired indices are identical (i.e. i = k and i 4 7) then: 


sche — 1 1 

[‘j, 7] = 5 (Oj 9% + 01951 — O:913) = 5 (0;9:; +0—0) = 5 O91 (no sum on 7) 
e When all the indices are identical (i.e. ¢ = 7 = k) then: 

seat al 1 

[i,t] = 3 (Oigis + O:Gii — Oi Gui) = isi (no sum on i) 


We note that the above 4 cases are inclusive to all the possibilities of the Christoffel symbol with 
regard to the identicality and difference of its indices (where in the case of two unpaired indices the 
symmetry in the paired indices should be considered in the above demonstration). 

6. Using the given expressions in the previous problem, derive the expressions of the Christoffel symbols 
of the second kind in orthogonal coordinate systems. 
Answer: The Christoffel symbols of the second kind are given by: 


Leg" (Gl 


In orthogonal coordinate systems we have gay = g*” = 0 (a  b), and hence the Christoffel symbols of 
the second kind are given by: 


De = g** lij, k] = (no sum on k) 


where g** = 1/gxz, is justified by the diagonality of the metric tensor plus the fact that the covariant 
and contravariant metric tensors are inverses of each other. Accordingly: 
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e When all the indices are different (i.e. i 4 7 4 k) then [i7,k] = 0 (according to the previous problem) 
and hence: 
[j,k] _ 0 


r*. 
4 Gkk Gkk 


e When the paired indices are identical (i.e. i = 7 and i # k) then: 


re = [é, Fi] = — 
oe) Big 29kk 


OnGii (no sum on i or k) 


e When two unpaired indices are identical (i.e. i = k and i 7) then: 


; _ [e,4] 1 
Y= = ae = Dg IS (no sum on #) 


e When all the indices are identical (i.e. 1 = 7 = k) then: 


(iz, 2] 1 : 
Tt. = = —— 0; 9ii no sum on 2 
" Gui 29% °°" ( ) 
Exercises 
1. The Christoffel symbols are not tensors although they have indices that refer to coordinates and basis 


12. 


vectors like tensors. Justify. 

Explain “affine tensors” within the context of Christoffel symbols. 

What are the number of Christoffel symbols and the number of independent Christoffel symbols of 
each kind in the spacetime of general relativity? 

What are the physical dimensions of the Christoffel symbols of the first and second kind? 

Prove the following relation: 0,9; = [ij, 1] + [j, ¢]. 

Show that the Christoffel symbols vanish identically iff the coordinate system is rectilinear (and 
Cartesian in particular). 

Prove that in any coordinate system, all the Christoffel symbols of either kind vanish identically iff all 
the components of the metric tensor in the given coordinate system are constants. 83] 

Show that the vanishing of the Christoffel symbols identically is a sufficient but not necessary condition 
for space flatness. 


. Summarize the main properties of the Christoffel symbols. 
. Show that E* Oj; E; = rk and Ex : 0;E' = mele 
. Show that: 


k = Ox* O07? La 

I Oxq Ox' Oxi 
where the lower-indexed x are Cartesian coordinates and the upper-indexed x are general coordinates. 
Show that the Christoffel symbols transform as: 


rk Oa” Og ORE 26 9 Om. eens 
Ko | —— 
9 Ax’ Axl Axi "Axe Axl"da'i 


Hence, conclude that the Christoffel symbols are affine tensors but not general tensors. 


2.7 Tensor Differentiation 


We mean by tensor differentiation covariant and intrinsic (or absolute) differentiation.!*4! In a sense, 
the former is the equivalent in curvilinear systems to the ordinary partial differentiation in rectilinear 


[83] “Tdentically” should indicate the validity over the entire space. However, the statement may also apply in a given region. 
[84] Tp fact, this is from a technical tensorial perspective (which is our main concern); otherwise we will investigate in the 


questions even ordinary types of differential operations on tensors (i.e. partial and total) and hence “tensor differentiation” 
is more general. 
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systems while the latter is the equivalent in curvilinear systems to the ordinary total differentiation in 
rectilinear systems. The objective of tensor differentiation is to ensure the invariance of derivative (i.e. 
being a tensor) in general coordinate systems, and this results in applying more sophisticated rules using 
Christoffel symbols. As we will see, using Christoffel symbols is based on differentiating the basis vectors 
due to the fact that the basis vectors in curvilinear systems are variable functions of coordinates and 
hence they should be differentiated like the components of the tensor (see § 2.6). The covariant derivative 
of a tensor is a tensor that is one covariant rank higher than the differentiated tensor while the intrinsic 
derivative of a tensor is a tensor that is the same rank and variance type as the differentiated tensor. 
However, we should note that the differentiation index in covariant derivative may be raised by the 
metric tensor and hence we get “contravariant differentiation” and therefore the resulting tensor is one 
contravariant rank higher than the differentiated tensor. 

In brief, the covariant derivative is a partial derivative of the tensor that includes differentiating the 
basis vectors as well as differentiating the components of the tensor. Accordingly, the covariant derivative 
of a general tensor A is given generically by: 


Aug = Op (AG BE, -+- Em E? ---E”) = At”, E;--- EE’ ---E” (57) 


where the semicolon is a conventional notation for the covariant derivative of the tensor with respect to 
the coordinate indexed by the following index (i.e. the k*” coordinate in this case). For example, the 
covariant derivative of a rank-2 mixed tensor, A = A,’E’E; is obtained as follows: 


Auk On (4/E'E;) 


aA;?) E'E; + Ae (0,.E") Ej + A;'E' (O.E;) 


E'E, + Aj (-It,E*) E; + 4/E' (T4,,Ea) 


( 
(A) 

= (a4, 2) BiB, — ATi, E*B, + A279, EE, 
(aA) 


E'E; — A21%,E'E; + ATs, E'E; 


I 


2 AgTs, + 4,°T),) E'B, 
Aj, B'B 


I 


where line 1 is a definition, line 2 is the product rule of differentiation, line 3 is based on the fundamental 
definition of Christoffel symbols (i.e. 0;E; = T'¥,E, and 0;E’ = —Ty,,E*), line 5 is relabeling of dummy 
indices, line 6 is taking a common factor, and line 7 is a conventional notation for the covariant derivative 
of the tensor with respect to the k*” coordinate «*. 

Following this pattern, we can see that the covariant derivative of a differentiable tensor A of type 
(m,n) with respect to the k“” coordinate is given by: 
biiathny i Lien im Atal 
AD = OpAge y m4 TA; sits saan eh eee (58) 

1 it ..-tm 
apa OS Se Ae 

The intrinsic derivative of a tensor along a t-parameterized curve x'(t) with respect to the parameter 
t is the inner product of the covariant derivative of the tensor and the tangent vector dz'/dt to the 
curve.!®5] In fact, the intrinsic derivative of a general tensor is a total derivative of the tensor that includes 
differentiating the basis vectors as well as differentiating the components of the tensor. Accordingly, the 


[85] For this to be strictly valid, the tensor should be defined over the region enclosing the curve and not only over the curve. 
However, we can always imagine (possibly hypothetically) that this is the case to justify this definition from a formal 
(or superficial) viewpoint. It is worth noting that in some old textbooks the intrinsic derivative is labeled as “covariant 
derivative along a curve”. 
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intrinsic derivative of a general tensor A is given generically by: 


d 


r , Ai ™®#, 2 -E,,E’ Ree DL k 
— (Ai BE, --- EB, EB! --- EB”) O(ARR ) dx 


(59) 


dt Oxk dt 
deem dark j n 
= jonk gp Bi: Em! ..E 
A 
= —" §£,.--E,,E’-.-E” 


where the chain rule is used in line 1 and Eq. 57 is used in line 2 while 5/6¢ in the last line is a conventional 
notation for the intrinsic derivative of the tensor with respect to the parameter t. For example, the intrinsic 
derivative of a contravariant vector A = A‘E; along a t-parameterized curve is obtained as follows: 


dA d 


a a (60) 
ee 
~ "dt dt 
- ” dt Oxi dt 
dA’ a dai 
Eg ae 
dA’ . dai 
= E,—+E,4*T,— 
dt KI dt 
dA’ . dx 
_ Atri. \E 
( dee a ) ‘ 
5A? 
ae 


where line 2 is the product rule of differentiation, line 3 is the chain rule of differentiation, line 4 is the 
fundamental definition of Christoffel symbols (i.e. 0;E; = TE Ex); line 5 is exchanging the dummy indices 
i and k, line 6 is taking a common factor, and line 7 is a conventional notation for the intrinsic derivative 
of the vector with respect to the parameter t. This pattern can be easily generalized to tensors of any rank 
and variance type. In fact, the pattern of intrinsic differentiation can be easily obtained from the pattern 
of covariant differentiation by noting that the intrinsic derivative is an inner product of the covariant 
derivative and the tangent vector (as seen in Eq. 59). 


Problems 


1: 


We started this section with the statement: “We mean by tensor differentiation covariant and intrinsic 
(or absolute) differentiation”. Comment on this. 

Answer: In fact, “tensor differentiation” in its general sense should also include ordinary differentiation 
(i.e. partial and total differentiation) of tensors but this has no particular relation to tensors, moreover 
it does not preserve the tensorial nature of tensors in general. Therefore, it is not of prime interest 
to us apart from determining its status within the calculus of tensors (as will be investigated in the 
questions). 

Make a list of the main properties and rules of tensor differentiation. 

Answer: We note the following:|*° 

e Tensor differentiation is the same as ordinary differentiation (i.e. partial and total differentiation) 
but with the application of the differentiation process on both the tensor components and its basis 
vectors using the product rule of differentiation. 

e The covariant and intrinsic derivatives of tensors are tensors. 


[86] Some of the following properties and rules are demonstrated and proved in B3 and B3X (which the interested readers 


are referred to for details). These properties and rules are also investigated further in the Exercises. 
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e The covariant derivative is 1 covariant rank higher than the rank of the differentiated tensor while 
the rank and variance type of the intrinsic derivative is the same as the rank and variance type of the 
differentiated tensor. Hence, a tensor of type (m,n) will have a covariant derivative of type (m,n+ 1) 
and an intrinsic derivative of type (m,n). 

e The sum and product rules of differentiation apply to tensor differentiation like ordinary differenti- 
ation. 

e The covariant and intrinsic derivatives of scalars and affine tensors!®7! of higher ranks are the same 
as the ordinary derivatives (i.e. partial and total derivatives). 

e The covariant and intrinsic derivatives of the metric, Kronecker and permutation tensors as well as 
the basis vectors vanish identically in any coordinate system. 

e Unlike ordinary differential operators, tensor differential operators do not commute with each other 
in general. 

e Tensor differential operators commute with the index replacement operator and index shifting oper- 
ators. 

e Tensor differential operators commute with the contraction of indices. 

e The covariant derivative is generally defined over a space or a region of space while the intrinsic 
derivative is defined over a curve. 

e The intrinsic derivative of a tensor may be seen as the inner product of the covariant derivative of 
the tensor and the tangent vector to the curve (by contracting the differentiation index of the covariant 
derivative with the index of the tangent vector). 


Exercises 


SOOO AV OOF 


10. 


11. 
12. 
13. 


14. 
15. 


16. 
17. 
18. 


19. 


. What is the essence of tensor differentiation? 


Show that the covariant and intrinsic derivatives of scalars and affine tensors of higher ranks are the 
same as the ordinary derivatives (i.e. partial and total derivatives). 

Show that the partial derivative of a scalar is a tensor (i.e. rank-1 tensor which is vector). 

Show that the total derivative of a scalar along a t-parameterized curve is a tensor (i.e. scalar or rank-0 
tensor). 

Demonstrate that the partial derivative of a non-scalar tensor is not a tensor (in general). 
Demonstrate that the total derivative of a non-scalar tensor is not a tensor. 

Show that the covariant derivative of a tensor is a tensor. 

Show that the absolute derivative of a tensor is a tensor. 

Derive, from first principles, the mathematical expression for the intrinsic derivative of a covariant 
vector A = A;E’ with full justification of each step. 

Using the expressions for the intrinsic derivative of the tensors A; and B? along a curve 2/ (t), show that 
the intrinsic derivative is an inner product of the covariant derivative of the tensors and the tangent 
vector dx /dt to the curve. 

Make a comparison between covariant derivative and absolute derivative. 

Show that the covariant derivative of the basis vectors vanishes identically. 

Show that the covariant and intrinsic derivatives of the metric tensor and the Kronecker delta tensor 
vanish identically. 

Elucidate the fact that tensor differential operators do not commute with each other in general. 
Show that tensor differential operators commute with the index shifting operators and the index 
replacement operator. 

Show that tensor differential operators commute with the contraction of indices. 

Show that the sum rule of differentiation applies to tensor differentiation like ordinary differentiation. 
Show that the product rule of differentiation applies to tensor differentiation like ordinary differentia- 
tion. 

What is the link between the following premises: “the intrinsic derivative is a total derivative of the 
tensor that includes differentiating the basis vectors” and “the intrinsic derivative is the inner product 
of the covariant derivative of the tensor and the tangent vector’? 


[87] Affine tensors here means tensors in affine coordinate systems. 
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20. The rank and variance type of the intrinsic derivative is the same as the rank and variance type of the 
differentiated tensor. Why? 


2.8 Parallel Transport 


We discuss in this section parallel transport!®*! which was used for characterizing the curvature of space 
(see § 2.3) and will be used for identifying geodesic paths (see § 2.9.4). The essence of the concept of 
“parallelism” is “having the same direction”. In flat space, parallelism is a global and absolute property 
as it is defined without reference to a particular position or path where the absolute concept of straight 
line with a fixed and global direction is used. This is because “direction” in flat space is independent of 
position. However, in curved Riemannian space we need a generalization or adaptation to the concept of 
parallelism to suit the curved nature of the space because in curved space we do not have fixed direction 
in the global and absolute sense that we have in flat space. In other words, “direction” in curved space 
is dependent on position. This adaptation and generalization is made through the concept of parallel 
transport whose essence is the characterization of parallelism by the local direction along a given curve 
in the space. So, parallel transport means moving (or transporting) a vector along a curve in a series of 
infinitesimal steps while keeping the vector constant during each step so that the vector in the end of the 
step is identical to the vector in the start of the step.|8°! In other words, keeping the vector infinitesimally 
parallel to itself during its journey along the curve. Hence, in Riemannian space the idea of parallelism 
is defined with respect to a local direction and with reference to a prescribed curve and therefore it is 
different from the idea of parallelism in the Euclidean sense. 

To be more clear let have a vector at a given position P; in an nD space (whether the space is flat or 
curved) and we want to transport this vector to another position P2 in the space such that the magnitude 
and direction of the vector at P, are the same as the magnitude and direction of the vector at P,. If the 
space is flat then we just move the vector along any path connecting P, to P, while keeping it parallel 
to its direction at P, because in flat space parallelism is global and absolute and hence it is independent 
of the position of the vector and the path of transportation. However, if the space is curved then it is 
not obvious how to do this because the direction in curved space intuitively depends on the position and 
hence on the path. So, we need to generalize the idea of parallel transport from flat space to curved space 
in a sensible way. To do this we need to find the mathematical condition that represents the process 
of parallel transport in flat space and generalize this mathematical condition to curved space. Now, if 
we parallel-transport a vector v along a t-parameterized curve in a flat space!®°! then we should keep 
the magnitude and direction of the vector constant along the path and this condition (i.e. keeping the 
magnitude and direction constant) is given mathematically by oe = 0, ie. the total derivative of the 
vector with respect to ¢ is identically zero along the path of transport since vanishing total derivative 
is equivalent to having a constant vector along the path. As we know from tensor differentiation (see § 
2.7), the total derivative along a curve in flat space becomes an absolute derivative in curved space. |91] 


Therefore, the generalization of the condition wy = 0 is simply done by replacing the total derivative 
with the absolute derivative, that is ~ = 0, i.e. the absolute derivative of the vector with respect to t is 


identically zero along the path of transport.|9?! 
In the following points we list a number of properties of parallel transport:!9# 


88 
89 


Parallel transport may also be called parallel propagation or parallel translation. 

In fact, the movement in infinitesimal steps is like the movement along straight segments since the curve is locally (or 
infinitesimally) straight. This should be supported by the fact that the space is supposed to be Riemannian and hence 
it is locally flat. 


90] We are assuming that we are using a Cartesian system which should be possible because the space is flat. 


91] In fact, becoming an absolute derivative in curved space is a requirement of the curvilinear nature of the coordinate 


system (since curved space must be coordinated by curvilinear system) and hence this “becoming” is also required in flat 
space if it is coordinated by a curvilinear system (as will be clarified later). 

92] Referring to the previous footnote, this generalization (i.e. replacing total derivative with absolute derivative) is needed 
for curvilinear systems in general even if the space is flat. The above description is essentially based, for the sake of 


simplicity, on employing a Cartesian (or affine) system to coordinate the flat space (as indicated in footnote [90] ). 
93 


These properties are general considering the inclusion of curved spaces. 
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S000 


10. 


11. 


. The curve along which parallel transport takes place should be differentiable. However, it is sufficient 


to be piecewise differentiable. 

Parallel transport is path dependent. Hence, given two points P, and P 2 in a space, the vector obtained 
at Py by parallel-transporting a vector from P; along a given curve C’ connecting P; to P, depends on 
the curve C. 

Because parallel transport is path dependent then starting from a given point P on a closed curve C, 
parallel transport of a vector around C' does not necessarily result in the same vector when arriving 
back at P. In fact, in curved space the final vector is generally different from the initial vector. 
Parallel transport of a vector field along a curve is unique. Hence, knowing a vector field at a given 
point on a given curve determines its parallel transport at all other points on the curve. 

The unit tangent vector of a geodesic curve is a parallel-transported field along the curve. In fact, this 
may be used as a definition of geodesic as “a curve that parallel-transports its tangent vector” or “a 
curve with a tangent vector field that is parallel-transported along the curve”. 

The magnitude of a parallel-transported vector field along a curve is constant. 

The angle between two parallel-transported vector fields along a given curve is constant. 

The inner product of two parallel-transported vectors along a given curve is constant. 

Parallel transport is field dependent. Hence, a parallel-transported vector field along a given curve is 
different (at each point on the curve) from another parallel-transported vector field along that curve. 
Parallel transport provides an indicator and measure of the curvature of space. As indicated earlier, in 
flat space a parallel-transported vector around a closed path returns to its starting point in the same 
direction at the starting point, while in curved space it generally (but not necessarily) returns to its 
starting point in a different direction to that at the starting point. In fact, the difference between its 
initial direction and its final direction can be used to quantify the intrinsic local curvature of the space. 
We note that for 2D space the angle between the initial and final vectors in this situation is a measure 
of the Gaussian curvature. 

Although parallel transport is initially defined for vectors (i.e. rank-1 tensors), it can be easily gen- 
eralized to include tensors of higher rank and hence we can talk about parallel transport of a rank-n 
tensor along a given curve. 


Problems 


Li; 


Justify the fact that the curve along which parallel transport takes place should be differentiable. 
Answer: This condition is required for parallel transport to be defined sensibly because the math- 


ematical condition for parallel transport is ~ = 0 where ~ = vie with v.; being the covariant 
derivative of the vector while ae (which is the tangent vector to the curve) is the derivative of the 


curve x'(t). Hence, the curve should be differentiable. 

Justify the fact that parallel transport of a vector field along a given curve is unique. 

Answer: Since a parallel-transported vector is constant in an intrinsic sense (i.e. constant in magni- 
tude and direction considering the intrinsic curvature of the space along the path of transport), then 
determining the vector at any point on the curve is sufficient for fully determining the vector at any 
other point on the curve because it is constant along the curve. 


Exercises 


Pwr 


ay 


Give a brief definition of parallel transport considering both flat space and curved space. 

Explain how the definition of parallel transport of a vector along a curve is formally implemented. 
Justify the fact that parallel transport is path dependent. 

Justify the fact that starting from a given point P on a closed curve C, parallel-transporting a vector 
field around C' does not necessarily result in the same vector field when arriving back at P. 

Justify, qualitatively, the fact that the unit tangent vector of a geodesic curve is a parallel-transported 
field along the curve. 

Show, formally, that the magnitude of a parallel-transported vector field along a curve is constant. 
Show that the angle between two parallel-transported vector fields along a given curve is constant. 
What you conclude from the last two exercises? 
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9. Show that the inner product of two parallel-transported vectors along a given curve is constant. 

10. Justify the fact that parallel transport is field dependent.|%4! 

11. Justify the fact that parallel transport provides an indicator of the curvature of space (i.e. in flat space 
a parallel-transported vector around a closed path returns to its starting point in the same direction as 
that at the starting point, while in curved space it generally returns to its starting point in a different 
direction to that at the starting point). 

12. Although parallel transport is primarily defined in relation to vectors, it can be extended to higher 
rank tensors. How? 


2.9 Geodesic Path 


In simple terms, geodesic path in curved space is the equivalent of straight line in flat space. Hence, 
geodesic path is commonly described as the most straight or most direct curve that connects two points 
in the space. This, of course, is not a technical definition and hence we still need to find a mathematically 
rigorous definition for geodesic path which will be developed later. However, as we will see the technical 
definition fundamentally rests on this generic definition, i.e. “the most straight curve”. 

Intrinsically, the geodesic paths are straight lines in the sense that an “intrinsic inhabitant” of the space 
to which they belong will see them straight since he cannot detect their curvature although they may be 
extrinsically curved as seen from an embedding space of higher dimensionality. This is due to the fact 
that only the intrinsic part of the curvature can be detected by the intrinsic inhabitant. Therefore, if this 
part vanishes the intrinsic inhabitant will fail to detect any curvature to the curve which is equivalent for 
him to having a straight line. Any deviation from such “straight lines” within the space is therefore an 
intrinsic curvature and hence it can be detected and measured by the intrinsic inhabitant. 

Although geodesic curve is frequently the curve of the shortest distance between two given points in the 
space (and hence geodesic may be defined as “the path of shortest distance”) it is not necessarily so. For 
instance, the largest of the two arcs forming a great circle on a sphere is a geodesic curve because it has no 
intrinsic (or geodesic) curvature but it is not the curve of the shortest distance on the sphere between its 
two end points. Accordingly, being the shortest path is a sufficient but not necessary condition for being 
a geodesic. A constraint may be imposed to make the criterion of minimal length apply to all geodesics 
by stating that geodesics minimize distance locally but not necessarily globally where an infinitesimal 
element of arc is considered in this constraint.!°°! In fact, we may also generalize the above definition of 
geodesic by replacing “shortest” with “optimal” or “extremal” (where being optimal or extremal is related 
to a given characteristic parameter of the curve not necessarily the length). It is noteworthy that being 
geodesic is independent of the choice of the coordinate system and hence it is invariant under permissible 
coordinate transformations. It is also independent of the type of representation and parameterization and 
hence it is invariant in this sense. This is because being geodesic is a real geometric property and hence 
it should not depend on conventional attributes such as coordinate system or type of representation and 
parameterization. 

To put the issue of geodesic paths in a proper context and to clarify many of its subtle aspects, we 
dedicate the next subsection to the issue of geodesic paths in 2D spaces or surfaces. These spaces are 
tangible and easy to visualize and analyze (by embedding them in the 3D Euclidean space that we live 
in) and hence we can get a beneficial insight into this issue with no need for highly abstract concepts 
and very sophisticated formal arguments that we usually need if we start this discussion from general nD 
spaces. However, this will serve as a case study and starting point for developing mathematically rigorous 
formal criteria for geodesic paths in general nD spaces. 


Problems 


1. List the different definitions of geodesic curve. 
Answer: There are three main definitions: (a) the most straight curve, (b) the curve of shortest 


194] This may sound trivial but the purpose of it is that two distinct vector fields that are parallel-transported along a given 
path will remain distinct at each point of the path. 
[95] Accordingly, a geodesic is made of infinitesimal segments each of which is of minimal length. 
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distance (or optimal length) and (c) the curve that parallel-transports its tangent. 
2. Give examples for geodesic paths in 2D, 3D and 4D spaces. 
Answer: The following are some examples: 
e 2D: arcs of great circles on sphere and meridians of surface of revolution. 
e 3D: straight lines in 3D Euclidean space. 
e 4D: trajectories (or world lines) of free particles and light rays in the Minkowskian spacetime of 
special relativity and in the Riemannian spacetime of general relativity. 

3. Give a physical interpretation for geodesic curves within the context of relativity theories (i.e. special 

and general). 
Answer: From the answer of the previous problem we can see that a physical interpretation for the 
geodesic curves is that they are the trajectories (or world lines) of free particles (whether massive or 
massless) in the spacetime (whether the spacetime is the flat Minkowskian space of special relativity 
or the curved Riemannian space of general relativity).!° 

4. What are the main types of geodesic in the spacetime of special and general relativity and how are 
they characterized formally? 

Answer: There are three main types: 

e Timelike geodesics which are possible world lines of free massive objects. They are characterized by 
positive spacetime quadratic form, i.e. (do)? > 0. 

e Lightlike (or null) geodesics which are possible world lines of free massless objects. They are char- 
acterized by vanishing spacetime quadratic form, i.e. (do)? = 0. 

e Spacelike geodesics which cannot be world lines of any physical objects (due to the restrictions on 
the speed). They are characterized by negative spacetime quadratic form, i.e. (do)? <0. 

We should note that the above classification of geodesics is based on the fact that the sign of the 
quadratic form (ds)* along any geodesic is constant (see next problem). 

5. Show that the nature of the geodesic path (i.e. being timelike, lightlike or spacelike) is preserved along 
the entire path, i.e. if a geodesic path is timelike/lightlike/spacelike at one of its points then it remains 
so along the entire path.!97 
Answer: Referring to § 1.3, a geodesic path with a tangent vector T is timelike/lightlike/spacelike 
according to whether T is timelike/lightlike/spacelike (i.e. T-T > 0, T-T =0, T-T < 0). Now, 
since geodesic path parallel-transports its tangent (see problem 1) and the inner product of parallel- 
transported vectors along a given curve is constant (see § 2.8) then T- T is constant and hence the 
nature of the geodesic path is preserved (i.e. constant) along the entire path. 

Note: in fact, the preservation of the nature of paths in spacetime is general for trajectories (or world 
lines) of real physical objects and is not restricted to geodesics. The reason is that according to the 
relativity theories massive objects always follow timelike trajectories and massless objects always follow 
lightlike trajectories while no physical object (whether massive or massless) can follow (even partly) 
spacelike trajectories. In fact, this can be inferred from the light cone structure where the trajectory of 
massive object is confined to inside the cone (which should be timelike) while the trajectory of massless 
object is confined to the cone itself (which should be lightlike). 

Accordingly, the above formal argument can be replaced by a simple physical argument, i.e. free 
massive/massless objects should always follow timelike /lightlike geodesics because they cannot change 
their nature (from massive to massless or from massless to massive),!°*! moreover they cannot follow 
a spacelike trajectory in any part of their geodesic trajectories. 


Exercises 


1. Discuss the idea of being optimal or extremal (as a criterion for geodesics) in the relativity theories. 
2. Compare between timelike geodesics and null geodesics within the framework of relativity theories. 


196] Tn fact, this applies to the timelike and null geodesics (see next problem). 

1971 This question may be phrased more simply as: show that the sign of the quadratic form (ds)? along any geodesic is 
constant. 

198] This is not inconsistent with the mass-energy conversion because in this case we have two different objects rather than 
a single object of two natures. 


2.9.1 Geodesic Path in 2D Spaces 70 


3. What is the difference between special relativity and general relativity with regard to geodesics and 
null geodesics? 

4. Give a brief characterization of geodesic path in Euclidean and Riemannian spaces. 

5. Why is the concept of gravitational force seen redundant in general relativity? What is the implication 
of this? How can this explain the correspondence between special relativity and general relativity? 

6. Compare between classical gravity and general relativistic gravity considering the concepts of mass 
and geodesic paths. 

7. Can the paradigm of “geodesic” replace the paradigm of “gravitational force” entirely? 

8. Is the paradigm of geodesic in general relativity a novelty of this theory? 


2.9.1 Geodesic Path in 2D Spaces 


Let discuss the issue of geodesic path in 2D spaces (i.e. surfaces) which are easy to envisage and hence 
we can get intuitive appreciation of the idea of geodesic and how it is formalized and quantified.!99! The 
curvature of a surface curve at a given point is represented by the curvature vector K which is given by: 


K=K,+K, =k,n+kgu (61) 


where K,, and K, are the normal and geodesic components of the curvature vector K, n and u are the 
normal unit vector to the surface and the geodesic unit vector, and Ky, and Kg are the normal and geodesic 
curvatures of the curve at the given point. As we know, the geodesic curvature is intrinsic to the surface 
while the normal curvature is extrinsic and hence it belongs to the embedding space. 

Now, based on our basic definition (or criterion) of geodesic curve as “the most straight path” we need 
to minimize (or rather eliminate) the curvature of the curve. Ideally, we want to have zero curvature (i.e. 
zero normal curvature and zero geodesic curvature) but from within the surface (i.e. intrinsically) this 
is achievable only for geodesic curvature because normal curvature belongs to the embedding space as it 
originates from the curvature of the surface itself within the embedding space. In brief, as long as we have 
a given surface that is curved within an embedding space and we have no access or control over the surface 
curvature then all we hope to minimize the curvature of a curve embedded into this surface is to eliminate 
the geodesic curvature which is intrinsic to the surface and hence it does not require access or control over 
the surface and its state within the embedding space. So, to have most straight path (i.e. geodesic) the 
geodesic curvature should vanish and this is the condition for the geodesic path in 2D space (or surface). 
Accordingly, if we want to generalize the concept of geodesic to spaces of higher dimensionality we need 
to impose the same condition, i.e. the intrinsic curvature (which is equivalent to the geodesic curvature 
in 2D space) must vanish. 

Now, let see how we can achieve this condition (i.e. vanishing of geodesic curvature identically) in 2D 
space. In this regard, we use our knowledge of calculus, specifically the fact that if something vanishes 
identically (i.e. not accidentally) then it can represent the rate of change (or derivative) of a constant 
entity. So, what is the constant entity that we should look for in this context (i.e. minimizing the 
curvature). Intuitively, curvature means twisting or changing direction, so the constant entity is ideally 
represented by the unit tangent vector T of the curve since this vector solely represents the direction of 
the curve with no involvement of other factors like magnitude. In other words, to have a geodesic path 
whose intrinsic curvature is zero because it does not twist (or curve) intrinsically, the derivative of the 
unit tangent vector T, which represents the direction of the geodesic curve, should vanish because T 
(considering the inherent curvature of the 2D space itself) is constant. i.e. T exactly follows the inherent 
curvature of the 2D space with no twisting or deviation from this inherent curvature. !!00 

The last thing that we need to address is: what kind of derivative (e.g. partial, total, covariant, absolute, 
etc.) should we use to achieve our objective? The answer is that: as long as we are dealing with an intrinsic 
issue (i.e. intrinsic curvature) within a potentially curved space then we need an intrinsic derivative that 
detects the rate of change of something in the space intrinsically and only within the space itself. Now, 


199] The reader is advised to consult B2 and B2X for more details. 
[109] The analog of this is straight line in a plane since this line follows exactly the inherent flatness of its space. 
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from our knowledge of tensor differentiation (see § 2.7) we can easily identify the absolute derivative 
(which is also known as intrinsic derivative for this reason) as the appropriate derivative for achieving our 
goal. As we know from the formulae of absolute derivative (e.g iA = a + A® hj de ), this derivative 
depends only on the surface coordinates and the Christoffel symbols and hence it is purely intrinsic to the 
surface. We should also note that the absolute derivative is defined along a curve (or path in space) and 
hence it is the pertinent derivative from this perspective. 

To sum up, to have a “most straight path” in a given curved surface the curvature of the path should be 
eliminated. Since the curvature of the path consists of intrinsic part and extrinsic part and because the 
extrinsic part belongs to the surface itself (considering its existence in the embedding 3D space) which we 
cannot change, then all we can (and should) do is to eliminate the intrinsic part. So, the characteristic 
feature of geodesic path is that its intrinsic (or geodesic) curvature is identically zero. Now, the intrinsic 
curvature is demonstrated by the twisting (say “right” and “left”) of the curve within the surface and 
this twisting is measured by the direction of the curve which is represented by the unit tangent vector 
T. Accordingly, the elimination of this intrinsic twisting (or curving) is equivalent to making the rate of 
change of T vanish, and because we are dealing with an intrinsic issue in a presumably curved space this 
rate of change should be intrinsic and of invariant (or tensorial) nature, i.e. absolute derivative (which 
is also the right type of derivative from the perspective of being defined over a curve). So, the required 
criterion for determining the geodesic path is that the intrinsic derivative of its unit tangent vector should 
vanish identically, that is:!!°! 


a Pines B oy 
6 (S jee dx? dx = (62) 


eure! De * Pay a 


where X is a natural (or affine) parameter of the curve, the indexed x are the coordinates of the curve, 
Ij, symbolizes the Christoffel symbols of the surface, and the standard notation of absolute derivative is 
in use (see § 2.7). The demand for the parameter to be natural arises from the fact that T = dr/dX with 
r(A) being the spatial representation of the curve and ) is a natural parameter.|!°?! 

The above discussion and arguments about the nature and criterion of geodesic paths in 2D spaces (or 
surfaces) can be easily and intuitively generalized to geodesic paths in spaces of higher dimensionality and 
hence we obtain the same formal criterion (as given by Eq. 62) for characterizing and identifying geodesic 
paths in any space. In the following three subsections we will exploit the insight that we gained from 
the 2D case study to rigorously characterize geodesic curves and find the required formal condition for a 
curve to be geodesic in nD space using its different fundamental definitions (i.e. “straightest”, “shortest” 
and “parallel-transporting its tangent”). As we will see, all these conditions are essentially based on the 
same fundamental principles and arguments that we discussed in this subsection and hence they can be 
seen as different demonstrations for a single geometrical principle on which the concept of geodesic rests, 
and this should explain why all these criteria are equivalent!!°3] as they lead to the same mathematical 
geodesic condition. 


Problems 


1. What natural and affine parameters mean? 
Answer: Natural parameter means that the parameter is curve length s while affine parameter means 
that the parameter is proportional to curve length, ic. 4 = as where a is a non-zero constant.|!94! 
Hence, affine parameter is more general than natural parameter. We should also note that “natural 
parameter” may be used to mean affine parameter (and vice versa) where the context should be 
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We follow in indexing the coordinates with Greek letters our convention in B2. 

The relation T = dr/d) is fully justified in the textbooks of differential geometry (see for example B2 and B2X; also 
see Exercises). We note that in B2 and B2X we use s instead of \. The shift in the present book from s to A is because 
X is commonly used in the books of general relativity to represent affine parameter (which is more general than natural 
parameter; see Problems) and hence it is more appropriate to use in this book. 

In fact, this equivalence applies in torsion-free spaces; otherwise there are details that should be sought in the extended 
literature of this subject. 

In fact, to be more general we should have \ = as + b where b is a constant. However, b is rather arbitrary and hence 
it can be set to zero and discarded with no loss of generality. 
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consulted to determine the appropriate meaning. We remark that in the relativity theories s is replaced 
by o (where for non-timelike paths its meaning is generalized). 


Exercises 


1. Investigate potential difficulties in the definitions of natural and affine parameters as given in the above 
Problem especially in the context of general relativity. 

2. Show that the magnitude of the tangent vector of a curve is constant when the curve is parameterized 
with natural or affine parameter. 


2.9.2 Geodesic Equation from Straightness 


The purpose of this subsection is to obtain the geodesic condition from the basic definition of geodesic as 
the most straight path. In fact, this was thoroughly investigated in § 2.9.1 within the context of 2D spaces 
and we concluded there that the generalization to spaces of higher dimensionality is straightforward since 
the dimensionality of the space is irrelevant to the fundamental geometric principles which the obtained 
geodesic condition rests upon. Accordingly, the geodesic condition from the basic definition of geodesic 
as the most straight path is given by Eq. 62 where a, 3,7 = 1,--- ,n (with n being the space dimension). 


Exercises 


1. Write the geodesic equation in a compact tensorial form. 


2.9.3. Geodesic Equation from Variational Principle 


The purpose of this subsection is to obtain the geodesic condition from the basic definition of geodesic 
as the shortest (or rather the optimal) path using the variational principle. In brief, a sufficient and 
necessary condition for a curve to be geodesic is that the first variation of its length is zero. Figure 1 is an 
illustration of how the length of the geodesic curve between two given points is subject to the variational 
principle. In fact, this condition may be taken as the basis for the definition of geodesic as the curve 
that connects two fixed points, P; and P2, whose length possesses a stationary value with regard to small 
variations in its neighborhood, that is:!1°l 


P2 
5] ds=0 (63) 
Py 


It is shown in the literature of calculus of variations that this condition is equivalent to the condition 
of being a solution to the Euler-Lagrange equation (see § 2.4) and hence being a solution to the Euler- 
Lagrange equation is a necessary and sufficient condition for stationarizing the arc length. Accordingly, 
all we need for obtaining the geodesic condition from the variational principle is to show that the geodesic 
condition can be obtained from the Euler-Lagrange equation, and that is what we will do in the Problems 
where we show that the geodesic equation is a solution to the Euler-Lagrange equation (i.e. Eq. 42). 


Problems 


1. Derive the geodesic equation from the Euler-Lagrange equation. 
Answer: According to the variational definition of geodesic curve (i.e. the curve of shortest or optimal 
length), the geodesic curve should extremize the length which is given by the integral [ ds. So, the 


variational principle is given by: 
ds 
= —d\= 4 
5 fas 5 f Gar=o (64) 


where 4 is a parameter for the geodesic curve. Now, the Euler-Lagrange equation is given in its general 
form by Eq. 42 where in our case: 


x = xX 


[105] Im other words, the geodesic is the optimal (in length) of all curves (connecting P; and P2) that are obtained by small 
perturbations in that neighborhood. 
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Figure 1: The length of a geodesic curve (solid) connecting two points, P,; and P:, as an extremum in 
comparison to the length of other curves (dashed) connecting these points where these curves are the 
result of small perturbations in its neighborhood. 
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Hence, the Euler-Lagrange equation becomes: 


OV git) d (OrV/gin'ti\ _ é (65) 
Oxk dy Ok - 
where the overdot represents derivative with respect to (i.e. Jy). 
Now, we have: - 
OVS Git) — (Ogi /Ox*) aa (66) 
= 


Ox 24/ gigk' tI 
where the power and chain rules (or composite rule) of differentiation (as well as the product rule) 
are used noting that <* and #/ (which represent the tangent) are independent of «* and hence 0,4 = 
Opt? = 0 (see exercise 2). 

Also: 


Og _— guj (O48 [08*) #7 + gust! (044/00) 


Oak 2 gigk' tI 


Gig 0) + gig" 5} 
2/G;j4°F 

945 t? + Gin 

Gj? + Ong? 


2 gig k' &I 


G58" (67) 
where in line 1 we use the power, chain and product rules of differentiation noting that g,;; is independent 
of &* (see exercise 3), in line 2 we use the fact that the coordinates are mutually independent and hence 
dx /dz* = 64 and O4) /d%* = 6;,!°5 in line 3 we use the Kronecker delta as an index replacement 
operator, and in line 4 we relabel a dummy index and use the symmetry of the metric tensor. Hence: 


dX On 7 VJ gig a5 


dn 
[106] This is like the well-known identity: Ox*/Oa = o (see exercise 4). 
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= dX 
Gig KEI 
ee Ce grit) (gist?) 
7 V Gigt' I a ae Dx (68) 
— gigt' I 


where in line 1 we substitute from Eq. 67, in line 2 we use the quotient rule of differentiation, and in 
line 3 we carry out differentiation in the second term (using the composite rule). 
On substituting from Eqs. 66 and 68 into Eq. 65 we get: 


<7 d(gnj%?) grit) A(gige*") 
(Ogi; /Ae*) stad V 958"? oH aa 


2y/ gig? LI 0 
2V/ Gig tI Giga 7 
wg Agni t? . d( gia?) 
Oxk Unrated 
2915 wins ot (9xi%") 4 gujt? A(gijt't?) 0 
d (9x57) 09:5 wigs grit) d (gist* i?) = 0 (69) 
dr Oxk gig tas dX 


where in line 2 we multiply with 2\/g,;#'47 and in lines 3 and 4 we simplify and carry out basic 
algebraic operations. Now: 


gus) = Bhi ga + 2 vee 
ri te ma #8! + 29K; ~ 
2 29h 5 ae Sap: a 


where in line 1 we use the product rule, in line 2 we use the chain rule in the first term, and in line 


4 we exchange the dummy indices ij and use the symmetry of the metric tensor in the second term. 
Accordingly, Eq. 69 becomes: 


O9k5 inj, OGik ij dil giz .4.5 — gugt? A(gijt*t?) 
<a' 4) 4 —'¢) + 29K; 2S pb oJ seca 28 
Axi LL Feu Gkj ah ark zr ijt DN 
O9K; OGik Ogi; iJ did Gnjt? d (gijt' a?) 
ea : 29%3 — = 6 
( dat * Bet dae J” © “Say Gig LHI dA 


Now, if we impose the condition that A is a natural (or affine) parameter then g;;@'¢! (which is the 
inner product of the tangent vector to the geodesic curve by itself and hence it represents the square 


of the magnitude of the tangent vector) is constant (see § 2.9.1) and hence the last term is zero, that 
is: 


Ogny , O9ik = OGiG \ «4-5 did 
Ox* Oxi Oxk 

1 (Ogn; , OG:R = O93 \ 24-5 did 

2 (4 + Oni — ce )™ 
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where in line 2 we divide by 2, in line 3 we multiply by g™”, in line 4 we use the fact that g’™* and Iki 
are inverses, in line 5 we use the Kronecker delta for index replacement, in line 6 we use the expression 
of Christoffel symbol, and in line 7 we use our convention that the overdot represents derivative with 
respect to A. The last equation is the geodesic equation as given earlier (with some trivial notational 
differences). 


Exercises 


1. State the mathematical condition for the variational principle in the spacetime of general relativity 
and interpret this condition in the case of timelike trajectory. 

2. Justify the following relation: 0,%' = 0 (as well as O,@ = 0). 

Justify the claim that g;; is independent of ar. 

4. Justify the following relation: 04'/Oz* = 61. 


2.9.4 Geodesic Equation from Parallel Transport 


In this subsection we discuss how to obtain the geodesic equation from the basic definition of geodesic 
as the curve that parallel-transports its own tangent. According to our previous investigation, parallel 
transport in curved space is the generalization of parallelism in flat space, while geodesic curve in curved 
space is the generalization of straight line in flat space. So, we need to make the analogy between the 
ideas of parallel transport and geodesic curve in curved space and the corresponding ideas of parallelism 
and straight line in flat space very clear. Now, in flat space straight line is characterized by having the 
same direction at any point, i.e. its tangent vector is constant and hence it remains parallel to itself as 
it moves along the line. So, to generalize the concept of “straight in flat space” to “geodesic in curved 
space” we can define geodesic as the curve whose tangential direction is “constant” with regard to parallel 
transport. This means that the tangent to a geodesic curve remains tangent to the curve when this 
tangent is parallel-transported along the curve and hence the geodesic curve in curved space has “fixed” 
or “constant” direction like straight line in flat space. 
Accordingly, the required condition for a curve to “parallel-transport its own tangent” is that the absolute 
derivative of the tangent vector is identically zero along the curve, that is: 
Of ame Bg pry aa 


= haga ie rea 
ot ae KI dt 


=0 (70) 


where the indexed T is the unit tangent vector to the curve while ¢ is a parameter for the curve. Now, 
if ¢ represents a natural parameter A of the curve (ie. ¢ = A) then the unit tangent vector is given by 


a de [107] On substituting this into the last equation we get: 


0 (71) 


d (dx . dak ; dxi 
d\ \ dX dy 3 dX 


1107] Refer to B2 and B2X for details. Also see § 2.9.1. We note that the same geodesic equation will be obtained if the 
parameter is affine (and hence T represents a tangent vector of constant magnitude). 
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which is the geodesic equation that we obtained earlier from other criteria and methods. So, the geodesic 
equation is a mathematical expression of the fact that the absolute derivative of the unit tangent vector of 
a geodesic curve vanishes identically where the “vanishes identically” condition comes from the condition 
of parallel transport. 

We note that in pseudo-Riemannian spaces (like the spacetime of general relativity) the line element 
do can be zero or imaginary and hence the definition of geodesic as “the path of shortest distance” could 
be questioned.!4°%! However, this may be avoided by the definition of geodesic that is based on the 
concept of parallel transport!!! although this can also be questioned. Anyway, we consider these cases 
as generalizations of the concept of geodesic, regardless of the adopted definition of geodesic, and hence 
there should be no problem in extending the concept and criteria of geodesic to pseudo-Riemannian spaces. 


Problems 


1. Outline the essence and objective of the last three subsections (i.e. § 2.9.2, § 2.9.3 and § 2.9.4). 
Answer: The essence of the last three subsections is the investigation of three conditions (based 
on the three definitions of geodesic) for characterizing the geodesic curve in general nD space. The 
objective of this investigation is to show that these three conditions are equivalent since they lead 
to the same mathematical condition for the geodesic path, i.e. the condition given by the equation: 
d?x* | Tt dx) dx” = 0 
da? '* jk dX dX ~~" 

Exercises 


1. State the mathematical condition for a vector field A’ to be parallel-transported along a given curve. 

2. Briefly discuss the qualification of geodesic curve as “parallel-transporting its own tangent”. 

3. Show, qualitatively, that the geodesic criterion of “most straight” and the geodesic criterion of “parallel- 
transporting its tangent” are equivalent. 

4. Justify the fact that in parallel transport of the tangent vector along a geodesic curve the parameter 
of the curve should be natural (or affine). 

5. Show that the geodesic equation can be written as ae = 50; gjx1/T* (where the indexed T is the unit 
tangent vector and A is a natural parameter). 


2.9.5 Geodesic Equation 


Based on our investigation in the previous subsections, we conclude that the necessary and sufficient 
condition for a naturally parameterized curve in an nD space to be geodesic is given by the following 
tensorial second order non-linear differential equation: 


pre =0 (73) 


where \ is a natural (or affine) parameter of the curve, the indexed x are the coordinates of the curve, 
I'3., represents the Christoffel symbols of the space, and a, (,y range over space dimensions (which for 
the spacetime of relativity theories become a, 3,7 = 0,1,2,3 according to our indexing convention). 
Accordingly, all the A-parameterized curves «“(X) that satisfy the above equation are geodesic paths in 
the space. So, what we need in our search for geodesic paths is to find the solutions of the above equation. 
However, because of its non-linear and tensorial nature this equation in most cases has no closed-form 
explicit solutions. 

From Eq. 73, it can be seen that being a geodesic is an intrinsic property because this equation depends 
exclusively on the Christoffel symbols which depend only on the metric tensor and its partial derivatives 
(see § 2.6). Hence, geodesic curves can be detected and identified by an intrinsic inhabitant of the space. 


1108] Th fact, this questioning may even include the definition of geodesic as “the most straight path”. 
[109] This may be justified by the claim that parallel transport is based on an abstract analytical condition although it is 
originally obtained from an intuitive generalization of parallelism in Euclidean space. 
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From Eq. 73, it can also be seen that for any flat space coordinated by a Cartesian system the geodesics 
are straight lines since in this case the Christoffel symbols vanish identically and Eq. 73 becomes ce =0 
which has a straight line solution. This should apply even if the flat space is coordinated by other types 
of coordinate system due to the invariance of geodesic (since it is a real geometric property) and hence 
geodesics in flat space are straight lines in general regardless of the employed coordinate system. We 
should finally note that the geodesic equation has only one free index and hence it is a vector equation. 
Therefore, in an nD space the geodesic equation represents n component equations (which in the case of 
the spacetime of relativity theories is 4). 


Problems 


1. Show that the affine parameter A in the geodesic equation can be replaced by the proper time parameter 
7 (i.e. the proper time parameter is an affine parameter). 
Answer: First, we should remark that in the relativity theories this should be restricted to timelike 
geodesics where r > 0 (i.e. geodesics of massive objects). The reason is that for lightlike geodesics (i.e. 
geodesics of massless objects) we have r = 0 and hence 7 is not a useful parameter, while for spacelike 
geodesics the proper time parameter is physically meaningless. 
Now, the geodesic equation is given by: 

da? dx? 
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where A is an affine parameter. Using the chain rule of differentiation, the above equation can be 
written as (see the upcoming note 1): 


d2x® (dr\? a» ax? dx? (dr 7 
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where 7 is the proper time parameter. Now, since: 
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then: 
dx a ax? dx? 
——— + a 
dr? BY dr dr 
which is the geodesic equation with the affine parameter being replaced by the proper time parameter 
T. 
Note 1: we have: 


aX MESS, = PEO da Or da dr _ dx* d [dr]\ dr 
a2 d\\ d\ ) dr\ dr d\/ a dr2 d\ dr dr |dXd|/) d\ 


Now, since 2 is constant (according to Eq. 74 since a and c are constants) then 4 [| = 0 and 


dX dt Ldx 
hence: 


=0 (75) 


dd? dr? \dv 
We should remark that since we are dealing with the spacetime of general relativity then we replace 
s (which is the space interval) with o (which is the spacetime interval) and hence the relation \ = as 
becomes 4 = ac. We also use the relation between 7 and a, i.e. T= a/c. 
Note 2: because U® = dx*/dr (where U® is the velocity 4-vector), Eq. 75 (which is the geodesic 
equation in terms of the proper time parameter) may be written as: 
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dU 
dt 


+1T3,U%UY = 0 


. . 6U _ 
The last equation may be written compactly as 5= = 0. 
Exercises 


1. List the necessary and sufficient conditions for a curve to be geodesic. 

2. What is the difference between the geodesics of special relativity and the geodesics of general relativity? 
Try to justify your answer technically. 

3. In the last four sections (i.e. § 2.6, § 2.7, § 2.8 and § 2.9) we investigated the topics of Christoffel 
symbols, absolute derivative, parallel transport and geodesic path. Try to investigate the relation 
between these topics and how they are linked together. Also, outline how this leads to the derivation 
of the geodesic equation. 

4. Show that in the Minkowski spacetime the geodesics of massive objects are described by the conditions: 
bas = 0 and ee = 0. What this means? 

5. Why the tensorial nature of the geodesic equation complicates its solution? 

6. Justify the invariance of geodesic paths across various coordinate systems and its independence of the 
type of representation and parameterization of the space and curve. 

7. What is the relation between the tensorial geodesic equation and the geodesic trajectory of an object 
(massive or massless) in spacetime? 

8. The motion of free objects (i.e. free of forces other than gravity) is subject to the geodesic equation. 
How will this change when the objects are not free (i.e. they are subject to non-gravitational forces)? 


2.10 Riemann-Christoffel Curvature Tensor 


This is a rank-4 tensor that characterizes the intrinsic curvature of space and hence it plays an important 
role in non-Euclidean geometries and their applications like general relativity. The tensor is used, for 
instance, to test for the intrinsic curvature of space and as a building block for the Einstein tensor which 
represents the geometric component of the Field Equation (see § 2.13 and § 3.2). As indicated before, 
the Riemann-Christoffel curvature tensor vanishes identically iff the space is globally flat and hence the 
Riemann-Christoffel curvature tensor is zero in Euclidean spaces and quasi-Euclidean spaces like the 
Minkowski spacetime of special relativity. The Riemann-Christoffel curvature tensor depends only on the 
metric which, in general coordinate systems, is a function of position and hence the Riemann-Christoffel 
curvature tensor follows this dependency on position. Yes, for affine coordinate systems the metric tensor 
is constant and hence the Riemann-Christoffel curvature tensor vanishes identically throughout the space, 
i.e. it is independent of position. 

There are two kinds of Riemann-Christoffel curvature tensor: first and second, where the first kind is 
a type (0,4) tensor while the second kind is a type (1,3) tensor.!4!°] The Riemann-Christoffel curvature 
tensor of the first kind, which may also be called the covariant (or totally covariant) Riemann-Christoffel 
curvature tensor, is given by: 


Rizk = Ox (Jl, t] — On (gk, t] + [el, 7] Vik — lik, 1] Vi (76) 
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where line 2 is based on Eq. 51,!'!!! while line 3 is based on Eq. 52. Similarly, the Riemann-Christoffel 
curvature tensor of the second kind, which may also be called the mixed Riemann-Christoffel curvature 


[110] Tn fact, these are the types that are in common use in the literature (due to their special significance in the formulation); 
otherwise the indices of the Riemann-Christoffel curvature tensor can in principle be shifted up and down (using the 
index shifting operators) like the indices of any tensor. 

[111] That is [i7, J = $ (O; 9:1 + Oigjt — O19%5) and hence 0, [ij, l] = 4 (On95 9:1 + OnOigjt — OnO1gij)- 
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tensor, is given by: 


Rint = OT, OW, ies M04, (77) 
The Riemann-Christoffel curvature tensor of the first and second kinds can be obtained from each other 
by the index shifting operator, that is: 


Re ix = g* Rajri and Rijn = Gia je (78) 


The covariant Riemann-Christoffel curvature tensor satisfies the following symmetric and anti-symmetric 
relations in its four indices: 


Bie = +Retiy (block symmetry) (79) 
= —Rjyiri (anti-symmetry in the first two indices) 
= —-Rijtz (anti-symmetry in the last two indices) 


The anti-symmetry property of the covariant Riemann-Christoffel curvature tensor with respect to the 
last two indices also applies to the mixed Riemann-Christoffel curvature tensor, that is: 


R ie — —R’ jy, (80) 


As a consequence of the two anti-symmetry properties of the covariant Riemann-Christoffel curvature 
tensor, the components of the tensor with identical values of the first two indices (e.g. Rjij~) or/and 
the last two indices (e.g. Rijx,) are zero. We remark that all the above symmetry and anti-symmetry 
properties of the Riemann-Christoffel curvature tensor of the first and second kinds can be proved by 
using the above mathematical expressions of this tensor.|!1?! 

In an nD space, the Riemann-Christoffel curvature tensor has n* components because it is a rank-4 
tensor. However, due to its symmetry and anti-symmetry properties (as well as the first Bianchi identity), 
the tensor has only wr) independent non-vanishing components.!!!%! For example, in 2D space the 
tensor has 24 = 16 components with only one independent non-vanishing component, while in 3D space the 
tensor has 34 = 81 components but only 6 of these are independent non-vanishing components. Similarly, 
in 4D space (such as the spacetime of general relativity) the tensor has 44 = 256 components but only 20 
of these are independent non-vanishing components. 

An important set of identities related to the Riemann-Christoffel curvature tensor and frequently used 
in the formulation of general relativity are the Bianchi identities. These identities, which are based on 
combinations of this tensor or its covariant derivative, are given by: 


4 


Rizr + Rizk + Rixtj = 0 (first Bianchi identity) (81) 
Rijkiim + Rijmkat + Rijime = 0 (second Bianchi identity) (82) 


These are called the Bianchi identities of the first kind. By raising the first index of the tensors in these 
identities the Bianchi identities of the second kind are obtained, that is: 


Rpt Rint Ray = 0 (83) 
R jet + FR ented ah Le stars = 0 (84) 
The first Bianchi identity as stated above is an instance of the pattern that by fixing the position of 


one of the four indices and permuting the other three indices cyclically, the sum of these three permuting 
forms is zero, that is:!1!41 


Rijn + Rage + Rikiy = 9 (i fixed) (85) 


112] See question 7.20 of B3X. 

113] See question 7.22 of B3X. We should note that the number of independent non-vanishing components (as given by the 
above formula) represents what is required by the symmetry and anti-symmetry properties; otherwise the actual number 
can be less than this (e.g. in flat spaces all the components are zero; also in certain locations in the space the number 
can be less than this). 

114] The first form of the first Bianchi identity is proved in § 7.2.2 of B3 while the fourth form is proved in question 7.28 of 
B3X. The other two forms can be easily proved following similar method. 
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Rizzi + Riyjik + Reju = 0 (j fixed) (86) 
Rijri + Rurng + Ryizi = 0 (k fixed) (87) 
Rizri + Rrijl Tr Ryril =0 (I fixed) (88) 


The pattern of the second Bianchi identity in its both kinds is also simple, that is the first two indices are 
fixed while the last three indices are cyclically permuted in the three terms. 


Problems 


1. Justify the fact that the Riemann-Christoffel curvature tensor is an intrinsic property of the space. 
Answer: This can be easily concluded from Eq. 76, for example, since the Riemann-Christoffel 
curvature tensor can be expressed in terms of the Christoffel symbols (and their derivatives) which are 
intrinsic attributes of the space (see § 2.6). 

2. Give an example of the involvement of the Riemann-Christoffel curvature tensor in tensor differentia- 
tion. 

Answer: The covariant differential operators in mixed derivatives are not commutative in general and 
hence for a covariant vector A and a contravariant vector B we have:!!15] 


Ae te An, cae Ae “POU i pi _ pi i 
Aj; Ajsik = Rj, Ai and Bo 1 By, = RiyB 


These equations show that the covariant differential operators are commutative iff the Riemann- 
Christoffel curvature tensor vanishes identically which means that the space is flat.!!!6 

3. Outline a proof for the statement: the space is flat iff the Riemann-Christoffel curvature tensor vanishes 
identically. 
Answer: First, we need to show that the Riemann-Christoffel curvature “tensor” is really a tensor. 
From the equation Aj.47 — Ajin = Ri Ai (which we stated in the previous problem) we see that 


Aj.k1 — Ajtk is a tensor (because it is the difference between two tensors) |117] and A; is a tensor (by 
assumption). Hence, by the quotient rule of tensors Rint is also a tensor. 

To prove the above statement, we have two parts: 

e If the space is flat then the Riemann-Christoffel curvature tensor vanishes identically: this is obvious 
because if the space is flat then it can be coordinated by a Cartesian (or pseudo-Cartesian) system 
where the Christoffel symbols vanish identically (see exercise 6 of § 2.6) and hence the Riemann- 
Christoffel curvature tensor also vanishes identically according to its definition (see Eqs. 76 and 77). 
Now, because it is a tensor then it should vanish in any other coordinate system (whether Cartesian 
or not) due to the value-invariance of the zero tensor (which was established in § 2.2). 

e If the Riemann-Christoffel curvature tensor vanishes identically then the space is flat: it is shown in 
the literature that if the Riemann-Christoffel curvature tensor vanishes identically then the space can 
be coordinated by a Cartesian (or pseudo-Cartesian) system and hence according to the definition and 
criterion of flat space (see § 1.3 and § 2.3) the space should be flat. 

Note: there are several approaches in the literature (such as the use of parallel transport or the use of 
commutativity of covariant derivatives) to prove the second part. However, in our view it may be more 


straightforward to prove this by using the geodesic deviation formula (i.e. eee + R%, D# ae dg = 0) 


Buy 
which is derived in § 7.13 where the vanishing of R%,/, leads to the vanishing of the second order 
62 D® 


absolute derivative of the vector D® (i.e. “S4- = 0) which should imply space flatness. 

4. Show that in rectilinear coordinate systems Rjj,) = 0. 
Answer: In rectilinear coordinate systems the components of the metric tensor are constants because 
the basis vectors are constants and hence all the Christoffel symbols of both kinds vanish identically 
(see exercises 6 and 7 of § 2.6). Hence, from Eq. 76 we get Rijn: = 0. 


Exercises 


[115] For more details, refer to § 5.2 of B3. 

[116] The commutativity should also apply in flat regions of space (though not identically over the entire space). 

[117] We note that the covariant derivative of a tensor is a tensor (see § 2.7) and hence the covariant derivative (of any order) 
of a tensor is a tensor (e.g. Aj;% is a tensor and hence its covariant derivative Aj;,1 is also a tensor). 
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1. What are the physical dimensions of the Riemann-Christoffel curvature tensor? 

2. Make a list of all the main properties of the Riemann-Christoffel curvature tensor (i.e. rank, type, 
symmetry, etc.). 

3. When the Riemann-Christoffel curvature tensor vanishes the covariant differential operators become 
commutative. Justify. 

4. Show that Rjjx~; = 0 if the Christoffel symbols vanish identically. What is the implication of this? 

Give a proof for the second Bianchi identity. 

6. The first Bianchi identity Rjj~. + Rij, + Riniy = 0 may be expressed as Rjj4~j) = 0 where the square 
brackets [] mean anti-symmetrization in the enclosed indices. Justify. 


Or 


2.11 Ricci Curvature Tensor 


The Ricci curvature tensor, which is a rank-2 tensor, is derived from the Riemann-Christoffel curvature 
tensor and hence it plays a similar role in characterizing the space and describing its curvature (although 
partly). There are two kinds of Ricci curvature tensor: first and second, where the first kind is a type 
(0,2) tensor while the second kind is a type (1,1) tensor.!!"8] The first kind is obtained by contracting the 
contravariant index of the Riemann-Christoffel curvature tensor of the second kind with its last covariant 
index, that is: 

Rij = Ria = OjV a _ Oa fj Tr Ee bi t. re, ha (89) 


where Eq. 77 is used to obtain this expression. As we see, the Ricci curvature tensor can be defined 
directly by the Christoffel symbols although it is originally based on the Riemann-Christoffel curvature 
tensor. 

Because Tj, = 0; (In /g),!449! the Ricci curvature tensor can also be written in the following forms as 
well as several other forms: 


Ri = 9;0; (In /G) — Oo + VHT 2. — 17; (In V9) (90) 
a 1 a 
= 00; (InV/g) + Fy ey, 0G (/9T 4) 
V9 
where g is the determinant of the covariant metric tensor. The Ricci curvature tensor of the first kind is 
symmetric, that is: 


Rij = Ry (91) 


This can be easily verified by exchanging the 7 and j indices in the last line of Eq. 90 taking account of 
the commutativity of partial differential operators and the fact that the Christoffel symbols are symmetric 
in their paired indices (Eq. 55).!!20l 

The Ricci curvature tensor of the first and second kinds can be obtained from each other by the index 
shifting operators, that is: 


Rag Ry and Ry = gn Rh (92) 
n(n+1) 


components but due to its symmetry only —— are 


In an nD space, the Ricci curvature tensor has n? 


independent (see § 2.2). 


Problems 


1118] In this we follow the literature where these two kinds may be considered as the main types; otherwise we will see in 
§ 2.13 that for the contravariant Einstein tensor G“” we need to define a contravariant Ricci curvature tensor R'”. 
Anyway, the Ricci curvature tensor is a tensor and hence in principle it can have contravariant form (as well as covariant 
and mixed forms) like other tensors. 

1119] See § 5.1 of B3. 

[120] See question 7.32 of B3X. 
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1. Obtain a mathematical expression for the mixed Ricci curvature tensor Ri, in terms of the Christoffel 
symbols and the metric tensor. 
Answer: We have: 


Ry = g Rey 
ik b b 
= 9 (OP ha — OVE; + Peal; — TRV Ea) 


where in line 1 we use Eq. 92 and in line 2 we use Eq. 89. 
Exercises 


1. What are the physical dimensions of the Ricci curvature tensor? 

2. What are the number of components and the number of independent components of the Ricci curvature 
tensor in the spacetime of general relativity? 

3. Make a list of the main properties of the Ricci curvature tensor (rank, type, symmetry, etc.). 

4. The first kind of the Ricci curvature tensor may also be obtained by contracting the contravariant 
index of the Riemann-Christoffel curvature tensor of the second kind with its covariant index before 
the last. Comment on this. 

5. Show that R,; = 0 identically is a necessary but not sufficient condition for a space to be flat. 


2.12 Ricci Curvature Scalar 


The Ricci curvature scalar R, which is also known as the curvature scalar and the curvature invariant, is 
the result of contracting the indices of the Ricci curvature tensor of the second kind, that is: 


R= @' Ry = Bi, (93) 


Problems 


1. Obtain a mathematical expression for the Ricci curvature scalar R in terms of the Christoffel symbols 
and metric tensor. 
Answer: We have: 


R= gi Rs 
g” (OG = ONY; A Te bi = ry, ba) 


where in line 1 we use Eq. 93 and in line 2 we use Eq. 89. 

2. How is the Ricci curvature scalar obtained from the covariant Riemann-Christoffel curvature tensor? 
Answer: It is obtained as follows: 
e The mixed Riemann-Christoffel curvature tensor is obtained by raising the first index of the covariant 
Riemann-Christoffel curvature tensor, that is: 


a ab 
Rijn =O Roigr 


e The covariant Ricci curvature tensor is obtained by contracting the contravariant index with the last 
covariant index of the mixed Riemann-Christoffel curvature tensor, that is: 
Rij = JER 5 = aT Me 

e The mixed Ricci curvature tensor is obtained by raising the first index of the covariant Ricci curvature 
tensor, that is: 

R* ae g” Rij 
e The Ricci curvature scalar R is obtained by contracting the indices of the mixed Ricci curvature 
tensor, that is: 

—_ sipk — pi 

R=6,R, =P, 
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Exercises 


1. What are the physical dimensions of the Ricci curvature scalar? 

Make a list of the main properties of the Ricci curvature scalar. 

Show that R = 0 identically is a necessary but not sufficient condition for a space to be flat. 
Compare the curvature information contained in the Riemann-Christoffel curvature tensor, the Ricci 
curvature tensor and the Ricci curvature scalar. 


Pwr 


2.13 Einstein Tensor 


The Einstein tensor G,, is a rank-2 tensor defined in terms of the Ricci curvature tensor R,,, the Ricci 
curvature scalar R and the metric tensor g,, as follows: 


1 
Gyp = Rw — pI kh (94) 


The tensor may also be given in contravariant and mixed forms, that is: 


Gh’ = REY — so" and Ge = Ri — -~8R (95) 
As we will see in § 3, the Einstein tensor represents the geometric part of the Field Equation while the 
energy-momentum tensor represents its physical part. 

Since both the Ricci curvature tensor and the metric tensor are symmetric, the Einstein tensor is also 
symmetric (see § 2.2). The divergencel!?!] of the Einstein tensor vanishes identically for any Riemannian 
metric, that is: 

Gh =90 (96) 


This can be easily proved by contracting the second Bianchi identity twice with the use of the anti- 
symmetry properties of the Riemann-Christoffel curvature tensor (see exercise 4). 


Problems 


1. Make a list of the main properties of the Einstein tensor. 
Answer: The main properties are: 
e It is synthesized from the Ricci curvature tensor, the Ricci curvature scalar and the metric tensor. 
e It is absolute rank-2 symmetric tensor and hence in nD space it has n? components and ale) 
independent components. Therefore, in the 4D spacetime of general relativity it has 16 components 
10 of which are independent. 
e It depends only on the metric tensor and hence it is intrinsic to the space. 
e It characterizes the geometry of space and expresses its curvature (partly). In fact, it represents the 
geometric part of the Field Equation of general relativity (see § 3.2). 
e It has the physical dimensions of reciprocal area. 
e Its divergence is zero identically. 
e It vanishes in flat space. 

2. Explain how to build the Einstein tensor from the metric tensor. 
Answer: We build it as follows: 
(a) We obtain the Christoffel symbols from the metric tensor (see Eqs. 51 and 52). 
(b) We obtain the Riemann-Christoffel curvature tensor from the Christoffel symbols (see Eq. 76). 
(c) We obtain the Ricci curvature tensor R,,, and the Ricci curvature scalar R from the Riemann- 
Christoffel curvature tensor and the metric tensor (see problem 2 of § 2.12). 
(d) We obtain the Einstein tensor from the Ricci curvature tensor, the Ricci curvature scalar and the 


[121] We note that the divergence in the textbooks of general relativity may be called covariant divergence or 4-divergence 
(or terms like these) to indicate its tensorial 4D nature and distinguish it from the ordinary divergence in 3D space 
coordinated by Cartesian system. However, we think this distinction is redundant because we are already assuming 4D 
spacetime coordinated by general coordinate system and hence divergence in this context is just a generalization of the 
ordinary divergence. 
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metric tensor using Eq. 94. 
Note: we may replace step (b) and step (c) with using Eqs. 89 and 93 without building the Riemann- 
Christoffel curvature tensor. 


Exercises 


1. Show that the physical dimensions of the Einstein tensor are reciprocal area. 

2. In flat space the Einstein tensor is zero. Explain why. 

3. What the identity G“”, = 0 means? 

4. Prove that the divergence of Einstein tensor is identically zero (i.e. G4”, = 0) using the second Bianchi 
identity. 

Show that g’°Gayv = 0. 

Show that G = —R where G is the trace of the Einstein tensor and R is the Ricci curvature scalar. 
Show that G,, =0 iff Ruy =0. 

Show the following: (a) G = 0 iff R=0 and (b) if G,, =0 then R=0. 

Compare the curvature information contained in the Riemann-Christoffel curvature tensor and the 
Einstein tensor. What is the implication of this? 


So GN HD OH 


2.14 Energy-Momentum Tensor 


The energy-momentum tensor T’”, which expresses the distribution and flow of energy and momentum 
in spacetime, is a rank-2 symmetric tensor and hence it has 16 components with only 10 of these being 
independent. These components are generally functions of spacetime coordinates and hence each point 
in spacetime is characterized by a specific value of the energy-momentum tensor. As we will see in § 
3.2, the energy-momentum tensor represents the physical part of the Field Equation while the Einstein 
tensor represents its geometric part.!!2?] We note that the energy-momentum tensor may also be called 
the stress-energy-momentum tensor or the stress-energy tensor. We should also remark that due to the 
mass-energy equivalence, according to the Poincare relation E = mc’, energy here includes mass since 
it is a form of energy. In the following discussion we generally assume an orthonormal 4D coordinate 
system whose all coordinates have physical dimensions of length. This should be valid at least locally in 
the Riemannian spacetime of general relativity. 

To determine the physical significance of the components of the energy-momentum tensor, let define 
this tensor in simple terms as the tensor whose T“” component represents the flux of the pi” component 
of the momentum 4-vector across a surface perpendicular to the v“” coordinate.!!?3!_ Now, “flux” in simple 
terms means “per unit area per unit time”. So, all we need to find the T“” component is to take the 
yu? component of the momentum 4-vector and divide it by the area of a surface element perpendicular 
to the v'” coordinate and by the time. Now, the surface element perpendicular to the v*” coordinate is 
defined by the product of two line elements of other two coordinates (i.e. other than the v*” coordinate). 
So, if we divide and multiply by a line element along the direction of the v‘” coordinate then the T“” 
component can be seen as the pi!” component of the momentum 4-vector divided by volume (i.e. the 
volume passing through the surface in a unit time) and multiplied by speed (i.e. the velocity component 
in the v‘” direction).!!741 So in brief, the component T“” of the energy-momentum tensor represents the 
density (i.e. per volume) of the y*” component of the momentum 4-vector times the v‘” component of 


[122] As we will see, the energy-momentum tensor may involve the metric and hence it may not be seen as purely physical. 

[123] We remind the reader that the momentum 4-vector is given by P = my (c, ul,u?, u®) where m is the invariant mass, 7 
is the Lorentz factor, c is the characteristic speed of light and (ul, u?,u?) is the velocity 3-vector. We also note that all 
the components of the coordinate (or position) 4-vector have the dimension of length (assuming certain type of spatial 
coordinates) since x° = ct (see B4). The reader should also be aware that expressions like “across a surface” are more 
general than being perpendicular or parallel to the surface since the surface can be tangential to the component of the 
momentum 4-vector (i.e. when the pt? and vt coordinates are perpendicular). Also, “surface perpendicular to the prh 
coordinate” means coordinate surface of constant v‘” coordinate. 

[124] The reader should note that this is a recipe (or pedagogical device) for synthesizing and analyzing the energy-momentum 
tensor rather than a depiction and reflection of the actual physical situation and process (in fact we mainly consider the 
case in which the p*” and v*” coordinates are identical). The validity of this approach will be verified in the discussion 
of the examples of the energy-momentum tensor in the questions. 
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Figure 2: The general structure of the energy-momentum tensor showing the positions of the components 
of energy density (ED), momentum flow (MF), spatial shear stress (SS) and spatial normal stress (NS). 


the velocity 4-vector (but without the 7 factor).!!25] Accordingly, the physical dimensions of this tensor 
are energy per volume (e.g. J/m?). 

To be more clear, let state the above description in a more practical and definite form considering a 
simple case. The component T° represents the energy density at the point of spacetime represented by 
the tensor while the components T” = T’® (i = 1,2,3) represent the temporal rate of energy flow per unit 
area perpendicular to the i*” direction divided by the speed of light c (the division by c is because the 
zeroth component of the 4-momentum is E/c). The components T” (i, 7 = 1,2,3) represent the temporal 
rate of flow of the i” component of momentum per unit area perpendicular to the j*” direction. In more 
simple (and rather different) terms, in orthonormal systems T°? represents energy density, T°’ = T’° 
represents momentum flow,!!?6] and T“/ represents spatial stress (see Figure 2). 

As indicated earlier and will be detailed later (refer to § 3.2), the energy-momentum tensor represents the 
physical part in the Field Equation (in contrast to the geometrical part that is represented by the Einstein 
tensor which is based on the space metric) since it mathematically portrays the distribution and flow of 
matter and energy in the “physical” spacetime. Accordingly, it is the source term in the gravitational field 
formulation of general relativity like mass density p in the gravitational field formulation of the classical 
theory (see Eq. 14 and refer to § 3.2). A characteristic property of the energy-momentum tensor is that 
its divergence is identically zero, that is: 

Th, =0 (97) 


The vanishing of divergence is a requirement for the conservation of energy-momentum as represented by 
the momentum 4-vector (see exercises 8, 9 and 11). 

The computation of the energy-momentum tensor in real physical situations is generally a complicated 
process if it is viable at all. However, in the literature of general relativity there are several simple 
examples that are used as prototypes to illustrate the structure, physical significance and method of 
computation of this tensor (in fact these prototypes are commonly used in real physical models such as 
modeling the Universe in cosmology). These examples include dust of non-interacting particles, ideal fluid 
and electromagnetic fields. We will discuss these examples in the questions of this section. 

In a spacetime region that is void of matter and energy, the energy-momentum tensor vanishes (i.e. 
Tyv = 0). As we will see in § 3.2, the Field Equation can be written as Ry, =k ine _ + 9uvT) and hence 


1125] The velocity 4-vector is given by U = y (c, ul,u?, u®) where 7¥ is the Lorentz factor, c is the characteristic speed of light 
and (u+,u?,u?) is the velocity 3-vector (see B4). 

[126] To be more consistent (regarding the physical dimensions) we may call it “momentum density flow” or “momentum flow 
density”. 
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the Field Equation in such case reduces to R,, = 0 and the solutions are called vacuum solutions.!!271 In 
this case only the Ricci curvature tensor is needed for the solution rather than the entire Einstein tensor. 
It should be noted that the condition R,,, = 0 does not mean that the spacetime is flat in the vacuum 
region because the condition for flatness is the vanishing of the Riemann-Christoffel curvature tensor (i.e. 
Ryvow = 0), and the condition R,,, = 0 is not sufficient for achieving flatness. This is a direct result of the 
fact that Ry» is a contraction of Ryyo and hence R,, does not contain the entire curvature information 
of the spacetime (see exercise 4 of § 2.12). This similarly applies to G,,, (noting that G,,, = 0 iff Ruy = 0; 
see exercises 7 and 9 of § 2.13). 


Problems 


1. Justify, in detail, the statements in the third paragraph of this section, i.e. the paragraph starting 
with: “To be more clear, let state the above description etc.”. 
Answer: We use in our justification the following statement (which is in the second paragraph): “So 
in brief, the component T” of the energy-momentum tensor represents the density (i.e. per volume) 
of the y*” component of the momentum 4-vector times the v*” component of the velocity 4-vector (but 
without the + factor).” 
Accordingly, the components of T#” are as follows (where we use loose symbolism in some cases): 
e T is the density (i.e. per volume V) of the 0” component of the momentum 4-vector (which is 
myc) times the 0°” component of the velocity 4-vector without the y factor (which is c), ice. 
poo _ mC aus myc? 2 E 
4 V V 
which is the energy density, as stated in the text. 
e T” is the density (ie. per V) of the 0" component of the momentum 4-vector (which is myc) times 
the i” component of the velocity 4-vector without the y factor (which is u’), that is: 


poi MIC i mycu' = myceu' = Eu’ _#- (dx' /dt) _ _ &/dt 
V V Ve Ve  (dx'dridz*)c  (dxidr*)c 


which is the temporal rate of energy flow per unit area perpendicular to the i*” direction divided by 
the speed of light c, as stated in the text. 
e T’” is the density (ie. per V) of the i“” component of the momentum 4-vector (which is myu") times 
the 0*” component of the velocity 4-vector without the 7 factor (which is c), that is: 

myut mycu’ — myctu’ — Eu' E (dz* /dt) E/dt 


i0 _ a = = 
ae Vor ae ae ~ Ve (dxtdzidzk)c (dxidzr*)c 


which is the temporal rate of energy flow per unit area perpendicular to the i*” direction divided by 
the speed of light c, as stated in the text. 
e T” (no summation over i) is the density (i.e. per V) of the i*” component of the momentum 4-vector 
(which is myu’) times the i*” component of the velocity 4-vector without the y factor (which is u‘), 
that is: _ 
my ja pu p (da! /dt) i p’ /dt 

V V dx'dxidx*  — daxidaxk 
which is the temporal rate of flow of the i*” component of momentum p’ per unit area perpendicular 
to the i” direction. 
e T” (i 4 j) is the density (i.e. per V) of the i*” component of the momentum 4-vector (which is 
myu') times the j‘” component of the velocity 4-vector without the y factor (which is wu’), that is: 


pi mr Pw _ pi (dei/dt) pide 
V 


V dx'dxidxk — datdx® 


[127] We refer the readers to exercise 13 of § 3.2 where the meaning and significance of local and global vanishing of the 
energy-momentum tensor is investigated and analyzed. Also see § 2.13 where it is shown that Gyr = 0 iff Ruv = 0, i.e. 
we have Guy = KT yy (according to the Field Equation) and since Tuv = 0 then Gyv = 0 and hence Ruy = 0 (according 
to exercise 7 of § 2.13). 
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which is the temporal rate of flow of the i*” component of momentum per unit area perpendicular to 


the j*” direction. 
eT!’ (i  j) is the density (ie. per V) of the j*" component of the momentum 4-vector (which is 
myu’) times the i” component of the velocity 4-vector without the y factor (which is u’), that is: 


myul e _ piul _ pi (de'/dt) _ pi /at 

V 4 dx'dxidxk = daxidaxk 
which is the temporal rate of flow of the j#” component of momentum per unit area perpendicular to 
the i*” direction. We note that we may also write: 


eS 


myul i myu' pen. pu p' (da /dt) _ pt/dt 
Vy SW  deidaida® ~ daxidak 
which is the temporal rate of flow of the i*” component of momentum per unit area perpendicular to 
the j'” direction. This (added to the above results regarding the components T®, T’° and T*’) should 
ensure the symmetry of the energy-momentum tensor. 
Note 1: we do not need to obtain the expression of T/* since it is contained in the expression of T” 
considering that i, 7 = 1,2,3 (and i 4 j since T” was considered separately). However, we did this for 
pedagogical purpose and to show the symmetry. 
Note 2: as we are supposed to use the energy-momentum tensor for general relativistic calculations, 
the use of the special relativistic formalism (like 4-momentum) in the above determination of the 
formalism of this tensor may be questioned. However, since the energy-momentum tensor is a function 
of coordinates and hence it is identified and determined point by point in the spacetime (where the 
spacetime at each point is supposedly flat by the locality condition) then the use of special relativity 
is justified (although this justification may also be questioned). 
2. Show that the energy-momentum tensor is symmetric. 
Answer: The answer of the previous question is sufficiently general to demonstrate the symmetry of 
the energy-momentum tensor in general. 
3. Find the energy-momentum tensor of a homogeneous cloud of dust made of identical non-interacting 
particles that move with a common velocity. 
Answer: This is one of the simplest examples of the energy-momentum tensor. Let the mass of 
each particle be m (since they are identical) and the common 3-velocity of these particles be u = 
(ut, u?,u®).051 Accordingly, the energy of each particle is ymc? and its 3-momentum is ymu where 
y is the Lorentz factor which is a function of u = |u|. Now, if the number density (i.e. the number of 
particles per unit volume) is N then we have: 
e The energy density of the cloud (which is represented by the component T°) is: 


= 


T® = Nyme? = pyc? 


where p is the mass density. 
e The temporal rate of energy flow per unit area perpendicular to the i*” direction divided by the 
speed of light c (which is represented by the components T° = T*° where i = 1, 2,3) is: 


(Nu' At) (ymce?) 


po = Ti _— 
(Atc) 


= Nymcu' = pycu' 


where (N u' At) is the number of particles crossing an area A (perpendicular to the i‘” direction) in 
time f, (ymc?) is the energy of each one of these crossing particles, and (Atc) represents “per unit 
area” (i.e. A), “temporal rate” (i.e. t) and “divided by the speed of light c’. 

e The temporal rate of flow of the i*” component of momentum per unit area perpendicular to the j*” 
direction (which is represented by the components TJ” where i, j = 1, 2,3) is: 


Nw At) (ymu') 


| 
Tas) 


= Nymu'u? = pyu'u? 


[128] We use m here as invariant, i.e. there is no difference between rest and non-rest mass. 
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where (N w At) is the number of particles crossing an area A (perpendicular to the j*” direction) in 
time f, (ymu’) is the i*” component of momentum of each one of these crossing particles, and (At) 
represents “per unit area” (i.e. A), and “temporal rate” (i.e. t). 
So, the energy-momentum tensor for this problem is given by: 


70. FOr. pee: pes Nymc?  Nymcut = Nymeu2 = Nymcu3 
(rT!) = De ae Ee Pe = Nymeut Nymutul Nymu'u2 Nymu'u 
- Nectar ed Bc Nymceu2 Nymu?ul Nymu?u2 Nymu?u 
Cel ee “Poe Nymcu3 Nymu3ul Nymu3u2 Nymuu3 

Coe eu? eu? ef ce ca ee 

= vm | oe He ve |_| oe Se ee ae 

cuy urw outue Uru cuy uru ute Uru 

cu® ueul uur urud cue u8ul uu? uu 


Note 1: we may also use the definition of the energy-momentum tensor (i.e. the flux of 4-momentum) 
and hence: 
e 


E E 2 
poo ( Le) XxC= vom — = Nmc’y = pyc? 
where (E/c) is the 0'" component of the momentum 4-vector, V is volume, c (which we multiply with) 
is the 0“ component of the velocity 4-vector (without y), and n is the number of particles inside 
volume V. 
e 


Poe tS ( fo xu = ou = Nmcyu' = pycu’ 
i i 
T° = 7 xC= — a Nmyu'c = pycu' 


where the steps are similarly justified. 
e 


es = 7 xu = a = Nmyu'u' = pyu'u' 
ij p j nmyulul ing i,j 
T = on aa ee = pyuw 
i p! i nmyul ut jt inj 
Te = yet yy Sime = pes 


where the steps are similarly justified. 
Note 2: it should be obvious that the above energy-momentum tensor can be written as: 


Ce cul cu? cu 
1 A LOS gas N N N N 
cul utut ulu? ultu m 
[T“”] = Nym =— plu =—_ UW P=:—_P? P= —_U"U 
cur u2ut utu? u?u 
v Y my Y 
cu® wut utu2 uiu3 


where P = my [c,u', u?, u?] is the momentum 4-vector of the individual particles, U = 7 [c, u', u?, u3] 
is their velocity 4-vector, and the superscript 7 symbolizes transposition. It is noteworthy that we are 
using here matrix notation. If we use symbolic notation of tensors (where T represents the energy- 
momentum tensor and UU represents the outer product of the velocity 4-vector by itself which may 


also be written as U @ U) and note that _ Nov Nom = po (where the subscript 0 refers to 


the proper quantity) then the above relation can be written as T = p97 UU which is the form that is 
commonly used in the literature. 
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4. Obtain the energy-momentum tensor for the problem in the previous question according to a co-moving 
frame.|!?9| 
Answer: The physical dimensions of the energy-momentum tensor is energy density (i.e. energy per 
unit volume). Hence, the components of this tensor can be generically expressed as mass density times 
speed squared (since mass times speed squared is energy like mc’). So, if we want to express this 
tensor in an invariant and simple form then we can use the proper mass density (which is invariant) 
and the 4-velocity of the cloud (which is also invariant), that: is:[18° 


ee = poU*U” 


where fo is the proper mass density (i.e. the density as seen from the co-moving frame) while U“ and 
U” represent components of the 4-velocity of the cloud. Now, in a co-moving frame u = 0 and y = 1 
and hence the velocity 4-vector becomes U = (c,0,0,0). Therefore, all the components of the energy- 
momentum tensor are zero except the energy density of the cloud (i.e. the component T°?) which 


is T°° = poU°U® = pocc = poc?. Therefore, the energy-momentum tensor according to a co-moving 
frame is: 
por? 0 0 0 
0 O 0 0 
apes | 
ee] 0 0 0 0 
0 0 0 0 
In fact, this can be obtained directly from the solution of the previous problem, i.e. 
ef ca ca* cui 
Le pelts 2 Di 1 eB 
py) cues att 
[T*"] = py ih? aap ait 4er3 
cue wu uu? ubud 


by setting p = po, y = 1 and u! = u? = u? = 0. So, the solutions of the two problems are consistent. 


In fact, the solution of this problem is a special case of the solution in the previous problem. 
Note: the easiest way to obtain the result of this question is to start from the general result that we 
obtained in the previous question (i.e. T = p9UU in symbolic notation which becomes T#” = ppUHU” 
in indicial notation). The result of this question can then be obtained immediately by noting that in 
a co-moving frame U = (c,0,0,0) as explained above. 

5. Find the energy-momentum tensor of ideal fluid in equilibrium considering the case of co-moving frame 
and assuming a locally flat spacetime.!!3!! 
Answer: Ideal (or perfect) fluid is characterized by having no internal friction forces (or viscosity) 
and hence it may be seen from this perspective as the continuum analog of the discrete model (i.e. 
cloud of non-interacting particles) of the previous two problems. However, ideal fluid is distinguished 
by having internal pressure due to its continuum nature. 
Now, if we use the argument and result of the previous problem then the components of the energy- 
momentum tensor of ideal fluid should have a term like the term in the previous problem (i.e. p9\U"U”) 
because the two systems are essentially identical (apart from being discrete and continuum models). 
Moreover, since ideal fluid has internal pressure (due to its continuum nature) then it should have two 
more terms: dynamic pressure term and static pressure term.!!°?! It can be shown that these terms 
are given by (p/ ) U#U” and —pg"” where p is the pressure (which is coordinate dependent) and g#” 


129] Co-moving frame is a frame that moves with the cloud and hence the particles of the cloud are stationary relative to 


the frame. Co-moving frame may also be called rest frame. 

In fact, this form was obtained in a more rigorous way in the previous question (see note 2 of the previous question). 
So, the purpose of this rather loose approach in this question is to obtain the result independently of the result of the 
previous question. Also see the upcoming note. 

Local flatness can be justified in general by the Riemannian nature of the spacetime. In fact, flatness may be justified 
even globally by weak gravity. 

We are assuming negligible thermodynamic effects (due to net directional bulk flow of heat energy) although thermostatic 
effects (due to random motion of molecules) should be included in the total energy (or mass-energy) of the system. In 
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is the metric tensor. Accordingly, the energy-momentum tensor of ideal fluid in a general frame is 
given by: 
Pp 
T™ = poU"U" + GUYUY — pg (98) 


We note that all the terms of this equation have the same physical dimensions (i.e. energy density) 
and hence it is dimensionally consistent. 

Now, in a co-moving frame u = 0 and y = 1; moreover if the spacetime is locally flat then the metric 
tensor can be approximated by the Minkowski metric (i.e. [g4”] ~ [n#”] = diag [+1,—1,—1,—1]) and 
hence the energy-momentum tensor in this case becomes: 


poe? 


0 

wy) 
r=] 5 
0 
Note 1: the justification of the form of [T””] as given in the last equation is that: due to the conditions 
u = 0 and [g#”] ~ [n#”] = diag [+1, —1, —1, —1], all the off-diagonal components will vanish, i.e. the 


first two terms of Eq. 98 will vanish by the condition u = O and the last term will vanish by the 
condition [g#”] ~ [7“”]. Moreover, we have: 


TO = poU® + UU — pg®® 
P 
= po (ye) (ye) + = (re) (Ye) —px 1 
por’ +p—p 
= por 


where line 3 is justified by y = 1 since u = |u| = 0. We note that the subscript 0 in po is a label (for 
proper) and not an index. 


Also: 
pe ss pu a UU — git 
= 0+0-p(-l) 
= p 


where i = 1,2,3 (with no summation over i) and line 2 is justified by U’ = 0 since u = 0. 

Note 2: referring to the aforementioned similarity between the ideal fluid system and the dust cloud 
system, we see that if we assume that the pressure of the ideal fluid is negligible (i.e. p ~ 0) then the 
energy-momentum tensor of the ideal fluid system will converge to the energy-momentum tensor of 
the dust cloud system of the previous problem. On the other hand, if we assume that the particles in 
the dust cloud system are interacting (by non-viscous forces simulating the pressure) then the energy- 
momentum tensor of the dust cloud system should become similar to the energy-momentum tensor of 
the ideal fluid system. 

Note 3: the physical dimensions of pressure are the same as the physical dimensions of energy density 
(i.e. kgm~1s~?) and hence the components of the above tensor are dimensionally consistent. 

Note 4: Eq. 98 represents one of the most general forms of the energy-momentum tensor in curved 
spacetimes and hence it is used in many applications of general relativity such as relativistic cosmology. 
In fact, it is possibly the most commonly used form in the applications of general relativity (due to the 
simplicity of ideal fluid as a model for gravity source as well as being relatively realistic in comparison 
to other simpler models). 


fact, some authors impose this in the definition of ideal fluid and hence ideal fluid is characterized by having no heat 
conduction as well as being inviscid. There are other definitions of ideal fluid (e.g. being isotropic in its rest frame or 
being totally characterized by density and pressure) some of which are equivalent or similar to others while others are 
questionable. However, most these definitions are incomplete or inaccurate since they focus on the properties of main 
concern in the given context. 
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6. 


Referring to the result that we obtained in problem 3 (which is a typical example of energy-momentum 
tensor), justify the generality of the definition of the energy-momentum tensor as “the flux of the 
yu’ component of the momentum 4-vector across a surface perpendicular to the v‘” coordinate” with 
regard to the temporal coordinate. 

Answer: For simplicity, consistency and clarity we use rather loose symbolism in the following answer 
(as we did in some previous answers). 

e For T™ the 0“ component of the momentum 4-vector is nyme and the 0"” coordinate is «° = ct and 


hence: , , 
nyme _myme _ onymen son oe 
At Act dx'dxi dx V 


e For T™ the 0°" component of the momentum 4-vector is nymc and the i‘” coordinate is x’ and hence: 


00 _ 


nyme nymec nymcdx* nm dx* 


= =e = ecu! 


To = = = = 
At (dxidx*)t  (datdxidx*)t Vet 


0 


e For T’° the i*” component of the momentum 4-vector is nymu’ and the 0” coordinate is 2° = ct 


and hence: 


pio nymu' nymeu' nymeu' nm; 
= — a = - — cu = CU 
At (dxjdx*) ct  dxidx*dx® aaa ae 
Exercises 
1. Why the energy-momentum tensor has 16 components with only 10 of these being independent? 


Pwr 


What are the physical dimensions of the energy-momentum tensor? Provide detailed explanation. 
Make a list of the main properties of the energy-momentum tensor. 

Show that the divergence of the energy-momentum tensor is zero (ie. T¥” = 0) assuming that the 
Field Equation is given as a postulate. 

What is the energy-momentum tensor of electromagnetic field? What is the significance of this? 
Show that the trace of the energy-momentum tensor of electromagnetic field is zero. 

What is the state of the Ricci curvature tensor R,, and the Ricci curvature scalar R in the vacuum 
problems!!331 of general relativity? What is the state of the curvature of spacetime in these problems? 
Comment on the result. 

Justify the premise that the vanishing divergence of the energy-momentum tensor (i.e. T#? = 0) is a 
requirement (or an implication) for the conservation of the momentum 4-vector. 

Show that the divergence of the energy-momentum tensor is zero, i.e. Ty = 0. 


. Justify the existence of vacuum problems and vacuum solutions in general relativity, i.e. why we should 


have curved spacetime at locations where T,,, = 0. 


. What is the significance of the fact that the divergence of the energy-momentum tensor is zero? Also, 


link this to the vanishing of the divergence of the Einstein tensor. 


. Make a comparison between the Einstein tensor G and the energy-momentum tensor T. 
. Discuss briefly the historical origin of the energy-momentum tensor and if it is a novelty of general 


relativity. 


. Comment on the following: 


Ri; = 0 (as well as R = 0) globally does not imply flatness (globally) from a geometric perspective but 
it implies flatness (globally) from a general relativistic perspective (according to the Field Equation 
and as required by special relativity). However, Rj; = 0 (as well as R = 0) locally does not imply 
flatness (locally) from a geometric perspective and it does not imply flatness (locally) from a general 
relativistic perspective. 


[133] 


“Vacuum problem” means gravitational problem related to a region of spacetime where there is no matter or energy. 
Accordingly, vacuum solutions are solutions of vacuum problems. 


Chapter 3 
Formalism of General Relativity 


The focus of this chapter is the formalism of general relativity which is represented by the Field Equa- 
tion. All the formal results of this theory (as a gravity theory and not as a “General Theory”) should 
be derived directly or indirectly from this equation possibly with the aid of some known physical and 
mathematical facts as well as certain assumptions and approximations. In the first section of this chapter 
we outline the background of the Field Equation and the rationale that is followed in developing this 
equation and applying it in general. This will provide a prior insight into the physical and mathematical 
framework of general relativity. This will also help to understand and appreciate this equation and its 
formal consequences which are highly abstract and difficult to comprehend. The subsequent sections will 
then investigate the Field Equation in its full form (with and without the cosmological constant) and in 
its linearized form. The chapter will be culminated by a brief investigation of the two main aspects of 
general relativity and its formalism (i.e. as a gravity theory and as a “General Theory”). 

We should remark that in general relativity we actually have two formalisms (which are independent 
of each other): a general-relativity-specific formalism represented by the Field Equation (and its direct 
consequences) and a special-relativity-specific formalism represented by the Lorentz transformations (and 
their direct consequences). In fact, the former formalism represents the gravitational component of the 
theory of general relativity while the latter formalism represents the local application of Lorentz mechanics 
in the spacetime of general relativity (where this spacetime is described and formalized by the former 
formalism) due to the local flatness of the spacetime according to general relativity. In other words, the 
former formalism creates the spacetime (whose structure incorporates gravity and its effects) while the 
latter formalism sets the physical rules that should be followed locally in this spacetime.|'#4l So, our 
investigation in the present chapter is about the general-relativity-specific formalism since the special- 
relativity-specific formalism belongs to the theory of special relativity and its literature and textbooks 
(see for example B4). 

In this context, it is important to appreciate the above-indicated fact that on one side general relativity 
is a gravity theory that is based on the Field Equation, and on another side it is a “General Theory” 
that is based on generalizations of the application and invariance of physical laws in various types of 
reference frame and the nature of this application and invariance where the principles of general relativity 
(i.e. the principles of equivalence and invariance associated with the claim of local application of special 
relativity in the spacetime of general relativity) play the major role in the latter side. [135] Accordingly, the 
main focus of our investigation in the present chapter is the gravity theory and not the “General Theory” 
although the latter aspect of the theory will be briefly investigated in the final section, as indicated above. 


3.1 Rationale of the Field Equation 


In the following points we outline the rationale that is followed (within the theoretical framework of 
general relativity) in developing and applying the Field Equation:!'*° 

e We start from the principle of invariance which general relativity rests upon. According to this principle 
the laws of physics in general should be independent of the employed coordinate systems and hence if a 


[134] Although these rules essentially come from special relativity (due to the local flatness of spacetime) they could be 
modified (due to their dependency on the frames and the transformations between them). In fact, this modification 
could be seen as a link between the two formalisms (and even as a link between the gravity theory and the “General 
Theory”). 

1135] Th fact, other principles (see § 1.8.4) may also play a role. 

1136] Tt is worth noting that in the development of the Field Equation (according to the above rationale) elements and 
principles that mainly belong to the “General Theory” may be employed although the Field Equation in itself represents 
the gravity theory. This can also be seen as an example of the links between the two sides of the theory. 
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physical law is valid in one frame it must be valid in all other frames. So, the privileged frames of special 
relativity (ie. inertial frames) are discarded and replaced by a more general type of frames (to include 
non-inertial frames), and consequently the Minkowskian spacetime and its flat metric are replaced by the 
more general Riemannian space and metric.!!871 Accordingly, the spacetime interval do, which is given in 
special relativity by (do)? = Nu dx" dx”, will be given by: 


(do)? = guyda"dx” (99) 


where g,,, is the covariant metric tensor of the spacetime (which is curved in general) while dx and dx’ 
are differentials of general coordinates. 

e The metric g,, of the spacetime should be determined by the distribution of matter and energy in 
the spacetime, and this can be seen as an implication of the principle of metric gravity (see § 1.8.4).138] 
Accordingly, the geometry of the “mathematical” spacetime (as represented by G,, which is ultimately 
determined by g,,,) should be linked to the “physical” spacetime (as represented by T,,,). 

e The distribution of matter and energy is defined by the energy-momentum tensor T”” which in the 
spacetime of special relativity and with the absence of forces satisfies the relation T“? = 0 which originates 
from the conservation principles of energy and momentum and expresses the divergence-free nature of this 
tensor.!!89] Now, in the spacetime of general relativity we are dealing with gravitational effects only (i.e. 
excluding any other force) and the gravity is accounted for by the space metric (i.e. it is not a force). 
Hence, by the principle of invariance the energy-momentum tensor JT” should also be divergence-free 
(as in the spacetime of special relativity) and hence it should satisfy the relation T4” = 0 (which is the 
curved-space version of the relation 7“ = 0 which is the flat-space version). 

e The divergence-free Einstein tensor G suggests a proportionality with the energy-momentum tensor 
T which is also divergence-free (as seen in the previous point). This should lead to the Field Equation 
G = «T where « is a proportionality constant (see § 3.2; we should also refer to § 3.3 where the cosmological 
term may be seen as generalization from proportionality to linearity). 

e The available physical setting of the problem (plus appropriate simplifications and assumptions) should 
lead to the determination of T. Employing the Field Equation with the use of the determined T should 
lead to the determination of G and hence the determination of the metric tensor which depicts the 
geometry of the spacetime. 

e The determination of the metric tensor will lead to the determination of the geodesic trajectories (or 
world lines) in the spacetime as well as other required physical consequences and results that depend 
on the metric, and hence the problem of gravitation as a tempo-spatial phenomenon is solved (with the 
extraction of all its natural conclusions). 


Problems 


1. Outline the theoretical approach which general relativity (as represented by the Field Equation) follows 
in addressing the kinematics of gravity. 
Answer: The theoretical approach may be outlined as follows: 
e Matter and energy in the spacetime (represented by the energy-momentum tensor) shape the geom- 
etry of the spacetime. 
e The geometry of the spacetime (represented by the metric tensor which is embedded in the Einstein 
tensor) determines the geodesic and null geodesic paths which are the trajectories (or world lines) of 
massive and massless objects in the spacetime (noting that these objects should be free because these 
trajectories presumably represent the effect of gravity alone). 


1137] This should imply discarding the privileged Lorentz transformations and replacing them by more general coordinate (or 
frame) transformations. We should note that the generalization to Riemannian space and metric requires more than 
the invariance principle and the independence of laws from the employed coordinate systems (because this relates to 
the nature of the coordinated space and not to the nature of the coordinate system). In fact, the equivalence principle 
(with its locality condition) or some other principle (see § 1.8.4) may be used to justify this generalization. 

[138] In fact, some may consider this as an implication of the principle of equivalence. 

1139] As noted earlier (see § 2.14), the energy-momentum tensor can be defined in the spacetime of special relativity but not 
as a source of gravity. 
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e The kinematical effects of gravitation are then described as free fall motion of massive and massless 
objects along these geodesic and null geodesic paths. 


Exercises 


1. What we mean by the “Rationale of the Field Equation” which is the title of this section? 

2. Outline the transition from classical gravity to general relativistic gravity. 

3. In the literature of general relativity we can find several attempts and methods for deriving the Field 
Equation from more fundamental principles in a rigorous way. Discuss this issue. 


3.2. The Field Equation of General Relativity 


As seen in § 2.13 and § 2.14, the Einstein tensor represents the geometric component of gravity while the 
energy-momentum tensor represents its physical component. We have also seen that the mathematical 
properties of these two tensors are similar, e.g. both are rank-2 symmetric tensors in 4D spacetime. So, 
it is logical that if we have to correlate the physics of gravity to the geometry of spacetime (as demanded 
by the rationale of general relativity) in an invariant form then we should find a formal tensorial relation 
between these two tensors. We have also noted that the divergence of both these tensors is zero, and 
hence it is logical to assume that these two tensors are proportional, that is: 


G=nT (100) 


where G is the Einstein tensor, « is a constant scalar (i.e. proportionality factor) and T is the energy- 
momentum tensor. Using the definition of the Einstein tensor and employing the indicial notation of 
tensors, the last equation can be written in covariant form as: 


Gu = KD (101) 
1 
Ruv - 59k = Kl (102) 
1 87G 
Ruy = a Iu kh = a ty (103) 


where line 2 is based on the definition of Einstein tensor (see Eq. 94) while line 3 is based on the 
expression of « which is determined by the correspondence principle (i.e. convergence of general relativity 
to Newtonian gravity in the classical limit) as will be justified later (see § 5.1). It is important to note 
that G here is the gravitational constant and not the trace of Einstein tensor. We should also note that 
depending on the convention about the tensors in the Field Equation and the sign of «, a minus sign may 
be added to the right hand side, i.e. Gy = —KT yp. 

As we will see, the main objective of formulating and solving the Field Equation in any particular physical 
situation is to find the metric of the spacetime which represents the geometric properties of the spacetime 
and hence it contains all the required information about gravitation according to the general relativistic 
doctrine. However, different strategies are followed in formulating and solving the Field Equation where 
these strategies mainly depend on the physical and mathematical considerations of the concerned problem 
as well as practical and procedural factors. 


Problems 


1. Compare the essence of the Newtonian gravity as a field theory represented by the equation V - g = 
—4nGop (see § 1.5) to the essence of the gravity of general relativity represented by the Field Equation. 
Answer: The Newtonian formulation correlates the gravitational field g to the mass density p as a 
source for the field. Similarly, the general relativistic formulation correlates the “gravitational field” 
represented by the spacetime geometry (as embedded in G and ultimately in the metric tensor) to 
the mass-energy source (as embedded in T). Hence, the two theories (as represented by their field 
formulations) are similar in essence although this similarity is not seen when we compare the Newtonian 
primitive formulation (represented by the form f = Guest | to the field formulation of general relativity. 
An important difference between the two formulations is that the Newtonian formulation is purely 
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spatial (in 3D Euclidean space) while the general relativistic formulation is tempo-spatial (in 4D 
Riemannian spacetime). Also, the source of gravitation in the Newtonian formulation is mass while 
the source of gravitation in the general relativistic formulation is mass-energy. We should also note 
that the classical formulation is scalar and hence it has a single component while the general relativistic 
formulation is tensorial and hence it has 16 components (10 of which are independent). 

What are the physical dimensions of K? 

Answer: We have |«&| = ne where the minus sign is dropped in evaluating the physical dimensions. 
Now, 87 is dimensionless, G (which is the gravitational constant) has the dimensions of force (which 
is mass times acceleration) times area per mass squared (e.g. Nm?kg~* = kg~'m*s~?) while c* has 
the dimensions of length* divided by time* (e.g. m+s~*). Hence, « has the dimensions of time squared 
per length per mass (e.g. kg~* m7! s?). 

Verify that the Field Equation is dimensionally consistent and comment on the result. 

Answer: We have G = kT. Now, if we use SI units to represent the physical dimensions then G has 
the dimensions of m~? (see § 2.13) while «T has the dimensions of kg~! m7! s? x kg m7! s~? = m7? 
(see the previous problem and § 2.14). Hence, both sides of the Field Equation have the dimensions 
of m~? and therefore the Field Equation is dimensionally consistent. 

Comment: the fact that the Field Equation has the physical dimensions of reciprocal area (e.g. m~”) 
which are the same as the physical dimensions of curvature reveals the nature of the Field Equation 
as a curvature equation. In fact, this also reveals the nature of general relativity as a geometric theory 
about the curvature of spacetime as a model for physical gravity. 

In the 4D spacetime of general relativity the Riemann-Christoffel curvature tensor Ry has 20 inde- 
pendent components while there are only 10 Field Equations. What are the implications of this? 
Answer: We can claim that there are two main implications: 

e It is possible to have vacuum solutions (i.e. curved spacetime and gravitational field in empty regions) 
since in this case we have only 10 vacuum Field Equations R,,, = 0 (since Ry, has only 10 independent 
components) while we have twice this number of R,,,. components. In other words, the vanishing 
of the 10 components (or constraints) on R,, (ie. Ryv = 0) is consistent with the non-vanishing of 
the 20 components (or constraints) on Rvp. (noting that the non-vanishing of R,,,,. implies curved 
spacetime and the existence of gravitational fields since a space is flat iff Ryvpw = 0). 

e It is possible that some of the spacetime curvature information are not included in the general rela- 
tivistic gravitational formulation since 10 Field Equations are not sufficient in general to incorporate 
the full curvature information contained in R,,,,,. In other words, gravity in general relativity is 
correlated to only part of the curvature of spacetime and not to the entire curvature of spacetime. 


Exercises 


Li; 


so 


11. 


How many component equations the tensorial Field Equation represents, and how many of these 
component equations are independent? 

What “solving the tensorial Field Equation” means mathematically and physically? 

What is the classical equation of gravity that corresponds to the tensorial Field Equation of general 
relativity? How many component equations the classical equation of gravity represents? 

Show that G = «KT and R = —«T where G,T, R are the traces of G4, TY, Re. 

Show that Rup =k (Cai _ $IuvT). 

Show that if a region of spacetime is void of matter and energy then the Field Equation in that region 
becomes: Ry, = 0. What is the significance of this? 

Does the condition R,,, = 0 at a given region of spacetime means that the given region is flat? Justify 
your answer. 

Show that G,, =0 iff T,, =0 (where these conditions apply locally or globally). 

Show that R,, =0 iff T,,, = 0 (where these conditions apply locally or globally). 


. Compare the relations that we derived in § 2.13 (e.g. G = —R) with the relations that we obtained in 


this section, e.g. Ry, =k (Dia — 591). 
Can you see a potential problem in the Field Equation from the perspective of the fact that the Einstein 
tensor contains less curvature information than the Riemann-Christoffel curvature tensor? 
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12. Discuss the principle of geodesic motion in general relativity and its relation to the Field Equation. 

13. Analyze the significance of the vanishing of the energy-momentum tensor locally and globally within 
the context of the Field Equation. 

14. Consider obtaining the Field Equation as a generalization of the classical gravity with the help of 
special relativity and some general principles (e.g. principle of invariance, principle of equivalence, 
principle of metric gravity, etc.). 


3.3. The Field Equation with Cosmological Constant 


A cosmological term may be added to the Field Equation and hence it becomes: 
1 
Guy + Aguy = KTyy or Ruv - (SR - A) Suv = KD (104) 


where A is the cosmological constant. The cosmological term was introduced by Einstein using cosmologi- 
cal arguments to make the Universe static, i.e. not expanding or shrinking.!'4°1 However, the cosmological 
term is still in use in modern cosmology (although for other purposes) even with non-static cosmological 
models. The cosmological term may be interpreted as representing the contribution of the so-called “dark 
energy” and hence the cosmological term may be considered as part of the right hand side, i.e. part of 
the gravity source as represented initially by the energy-momentum tensor.!'4!] Accordingly, the Field 
Equation will be written as: 


Ryuv — sow = KT yy — AgGuy or Ruv — sow ht (Te -_ ihe (105) 
In fact, the cosmological term may be absorbed in the «7j,, term and hence the term KT;,, is understood 
to incorporate the effect of the cosmological term, i.e. T;,, includes the effects of the —Zg, term in the 
last equation. However, it should be noted that in this case the cosmological term is still independent 
of other sources of gravitation and hence the vanishing of mass and energy in the spacetime does not 
imply the vanishing of cosmological term, i.e. T,, could stand for the energy density represented by the 
cosmological term alone. 

We should remark that dark energy (which is supposedly represented by the cosmological term) may 
also be labeled as “vacuum energy” although vacuum energy may be seen as only one possible form or 
source of dark energy. The use of “vacuum energy” to refer to dark energy originates from the premise that 
the source of dark energy (or one of its sources) is the alleged “quantum mechanical vacuum fluctuations”. 
In fact, the disastrous failure of the quantum mechanical calculations to provide a sensible quantitative 
estimate of this alleged vacuum energy should discredit (at least for the time being) any attempt to explain 
the alleged dark energy by the alleged vacuum fluctuations. There may also be another reason for the 
“vacuum” label that is it is associated with the ground state of the Universe, i.e. its vacuum state. We 
should also remark that because of its cosmological origin and significance (as its name suggests) and 
because we have no primary interest in cosmology in the present book, we do not go through the details of 
the cosmological term and constant. The interested reader should therefore consult the relevant literature 
of relativistic cosmology. 

Finally, the reader should note that the use of the Field Equation with the cosmological constant 
term (i.e. Eq. 105) should be restricted to the cosmological problems and hence in all non-cosmological 
problems the original form of the Field Equation (i.e. Eq. 101) should be used. This is because the 
contribution of the cosmological term in non-cosmological problems is completely negligible due to the 
extreme smallness of the cosmological constant (assuming it is not zero). This should also be justified 
from a purely conceptual and theoretical perspective as can be easily inferred from the “cosmological” 
label that is attached to this term and this constant. 


[140] 74 appeared later that this proposed universe is unstable (i.e. it deviates from its static state by any perturbation and 
hence it becomes expanding or shrinking). 

141] Tn fact, this move from the left hand side to the right hand side may be seen as converting the cosmological term from 
being a geometric factor to be a physical factor. 
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Problems 


1. What are the physical dimensions of the cosmological constant? 
Answer: Due to the requirement of dimensional consistency, all the terms of physical equations should 
have the same physical dimensions. Now, if we remember that g,,, is dimensionless (see § 2.5) and the 
dimensions of the other terms of the Field Equation are reciprocal area (see § 3.2) then from Eq. 104 
we can conclude that the physical dimensions of the cosmological constant are reciprocal area (and 
hence it essentially represents curvature). 

2. Verify that the dark energy density term (i.e. —A Juv) has the physical dimensions of energy density 
(like Ty.). 
Answer: g,, is dimensionless, A has the dimensions of reciprocal area (e.g. m~”) according to the pre- 
vious problem, and « has the dimensions of time squared per length per mass (e.g. kg! m~!s?) accord- 
ing to problem 2 of § 3.2. Hence, the term —44,,,, has the physical dimensions of m~?/ (kg~' m7!s?) = 
kgm—!s~? = J/m?, i.e. energy density. 

3. Comment on the physical and epistemological significance of the cosmological term. 
Answer: The cosmological term represents the physical effects of the so-called dark energy (or vacuum 
energy where vacuum energy may be seen as an instance or potential source of dark energy). In our 
view, dark energy is a metaphysical concept that was introduced into modern physics to address certain 
limitations in its theoretical structure. In fact, it is just one example of the numerous metaphysical 
objects and concepts that infest modern physics and cosmology, and the main credit for this belongs to 
general relativity. These metaphysical entities usually originate from highly hypothetical considerations 
and contemplations that are totally detached from physical reality and hence they are not legitimate 
in physical science although they may be legitimate in other disciplines (such as religion or philosophy 
or pure mathematics). 

Exercises 


1. Using the Field Equation with the cosmological term, show that G = «T — 4A and R= —«T + 4A. 

2. Show that Ryp = (ay _ $v) + Ag, and comment on its significance. 

3. What is the relation between the cosmological term and dark energy? 

4. Discuss dark energy and assess it as a physical or metaphysical concept. Also, discuss and assess dark 

matter from the same perspective. 

Outline the characteristics of the cosmological term Agyy. 

6. Outline the characteristics of the cosmological constant A. 

7. Discuss briefly some nonsensical consequences and implications that are commonly associated with the 
cosmological term and constant. 

8. Discuss the implication of the presumed existence of dark energy on special relativity. 


as 


3.4 The Linearized Field Equation 


In this section we briefly investigate the linearized form of the Field Equation which is used as a good 
approximation to the full (non-linear) Field Equation in the case of weak gravitational fields where the 
spacetime is pseudo- or quasi-Minkowskian and hence it deviates slightly from the flat geometry of the 
authentic Minkowskian spacetime.!!42]_ The importance of the linearized form is not only because of its 
status as a practically-useful approximation (and hence it can save valuable effort when high accuracy is 
not required) but it is also because of its use in some applications of general relativity (e.g. gravitational 
waves) where formulations based on the full Field Equation are prohibitive or unavailable. In this regard, 
the linearized form serves like numerical methods and classical gravity in providing reasonable solutions. 
In fact, the linearized form can also be used in some theoretical arguments and contentions and hence it 
is useful to have regardless of any other reason. 

In the following points we outline the derivation of the linearized form of the Field Equation. However, 
we should remark that we omitted some detailed technicalities and justifications (as well as background 


42] Th fact, gravity in the weak field regime can be seen as a minor curvature in spacetime and as a Lorentzian field in a 
flat Minkowskian spacetime. From the latter perspective, it is like electromagnetism in special relativity. 
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materials) to keep inline with the intended size, style and level of the book. In fact, this section is mainly 
included for the purpose of completeness and hence the reader can skip it with no loss of continuity 
(although some of the contents in this section are used casually as background materials later in the book; 
see for example § 5.1). 

e We note first that all the approximations in the derivation of the linearized form are to the first order 
and hence any non-linear function will be linearized by discarding higher order dependencies. In fact, 
this is the essence of the linearization process which the linearized form is based on. However, we should 
remark that the linearization is solely related to the metric and the geometric entities that are based on 
the metric, i.e. it is related to the geometry of the spacetime exclusively. 

e As indicated above, the spacetime in the linearized form is quasi-Minkowskian and hence in a first order 
approximation the covariant metric tensor g,,, can be expressed as: 


Juv = Nuv =F Ev (106) 


where 7,, is the covariant Minkowski metric tensor and ¢,, is a perturbation tensor representing first 
order deviation from flatness such that |e,,,| << 1 and where for the sake of simplicity we used (and will 
continue to use) exact equality to represent approximate equality to first order.!’4%! The symmetry of Ou 
and ,, implies the symmetry of €,,, (see § 2.2). 

e By a similar argument to the argument in the previous point we can conclude that in a first order 
approximation the contravariant metric tensor g“” can be expressed as: 


SOOT (107) 
where 7” is the contravariant Minkowski metric tensor and y“” is a perturbation tensor representing first 
order deviation from flatness such that |x“”| < 1. It can be shown (see exercise 2) that x4” = —e"” and 
hence: 

Cai, tae (108) 


The symmetry of g"” and 7” implies the symmetry of e“” (see § 2.2). 

e For the purpose of lowering and raising indices of minute tensorial quantities we can use n,, and nh” 
as index lowering and index raising operators instead of g,, and g’”. The justification is that in the first 
order approximation the product of ¢,,, and e¥” with the minute tensorial quantity should be discarded 
because it is of higher order. For example, if A” is a minute tensorial quantity then we have: 


Guv AY = (hie Feu) A’ = pA” HegA’ = nywA* =A, (109) 


where in the third equality we discarded ¢€,,,A” because it is of higher order. Similarly, if B, is a minute 
tensorial quantity then we have: 


gt” By = (nt” — #”) B, =n!” B, — eB, = 7” By = BY (110) 


where in the third equality we discarded <“” B, because it is of higher order. Accordingly, for the purpose 
of index lowering and index raising of minute tensorial quantities we have: 
Suv = Nw and gt” = 1” (for lowering and raising only) (111) 


e The Field Equation of general relativity is given by: 


1 
Ru Pn 5 Iu = KD (112) 


As indicated above, the linearization process is related to the metric and its companions. In other words, 
to obtain the linearized Field Equation we need to find the linearized forms of the geometric quantities in 


1143] Tn fact, €yv may be considered as Lorentzian tensor but not general tensor. We should also note that we need to assume 
the smallness of the derivatives of the components of ¢,, as well (and hence their products are negligible according to 
the first order approximation) although this may be understood from our previous explanation. 
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the Field Equation (which are R,,, and R, noting that g,,, is already linearized) and insert these linearized 
forms into Eq. 112. 

e To obtain the linearized form of R,, and R we need first to obtain the linearized form of the Christoffel 
symbols, that is: 


: g’* 
Div = Ps (Gv dpa + OnGva — OaGpv) 
— - (QvE pea T OnEva > OnE wv) 
= 7 (OvE pa + OnEva = Oo€ pw) 
1 
= 1 (ey! + dues? — Oe) (113) 


where line 2 is because 7, is constant, in line 3 we use Eq. 111, and in line 4 we raise the index a by 
nv@[1441 We note that 0” is the contravariant partial differential operator. 
e Ry is given by (see Eq. 89): 


= a a B a B a 
Ruy = O02, — 0.0%, $1813, —T37S, (114) 


Now, the last two terms are second order in e’s (or their derivatives) and hence the linearized R,, is given 
by: 


Ray SOT O,r8; 


ag (Oe 0,027 = 000°) = ; (On0e? 4Os0je2 = 030% 0) 


(QOuee — WOE pa — Oa pee + Oo0E pr) 


WOVE — OVO" Epa — OvOpEy + * Eien) 


Op Ope — OvO*E pa — OnOpep + Ae) 


NLR MIR w]e wel] e bw 


i i 


DOE — OyOae SX — Oa0Ones +O Epv) (115) 


where in line 2 we use Eq. 113, in line 3 we use the commutativity of the partial differential operators, 
in line 4 we use the definition of the d’Alembertian operator (i.e. 0? = 0.0%; see B4 and exercise 15 
of § 2.2), and in line 5 ¢ represents the trace of éy- The second term in the last line is justified by the 
following: 


OW Ena = (n°? dg) Ena = Op (eis) = 0 (e;) = OaE (116) 


where the constancy of 7%? allows it to pass the partial differential operator (i.e. they commute). 
e We obtain the linearized form of R by contracting the linearized form of R,, (after raising its first 
index), that is: 


Re 
= gh Ru 
— ne” Rup 


pV 
= Se (Oye — O,Aae oe — Bye + Wey) 


[144] The step in line 3 may seem unnecessary. However, it should be needed for instance for justifying the commutativity of 
the metric with the partial differential operators (see for example Eq. 116). Moreover, it is more appropriate considering 
that eyv is essentially a perturbation tensor on a Minkowskian spacetime background. 
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1 
= 5 ( 26 — O,0,6"% — OnOper™ + Ze) 
= ; (Oe — d,0,E"* — Oy0,e"* + 07) 
= Oe-0,0,e%* 
= Oe —0,0ge%* (117) 


where in line 3 we use Eq. 111, in line 4 we substitute from Eq. 115, in line 5 we use 7” as an index 
raising operator (noting that n“”0,0,¢ = 0,0”e = Oe), and in line 6 we relabel the dummy indices in the 


second term (noting the symmetry of e”° or the commutativity of the partial differential operators).!145) 
e On substituting from Eqs. 115, 106 and 117 into Eq. 112 we get: 
1 ik 
5 (OOpe — WOo€ pf — OvOpey + eau) 3 (Nuv + Ep) ( Fea DaOge"*) = RE, (118) 
(Q,Ou€ — Ov0aeES — One S + Oey) — Nv (Be -— Aadge"*) = WT yy (119) 
"Eny + Ope — OOaE eS — Oa.OnES — Nv (O?e — Oadge**) = WT yw (120) 
where in line 2 we discard the higher order terms, i.e. E,, ( 2o— In.ge°). The last equation is the 


linearized form of the Field Equation. 

Exercises 

1. The linearization process is solely related to the metric and the geometric entities that are based on 
the metric. Why? 

2. Show that g#” = nt’ — eh”, 

Assess the reliability of the linearized form of the Field Equation. 

4. What is the significance of linearizing the Field Equation? 


oe 


3.5 General Relativity as Gravity Theory and as General Theory 


As indicated already repeatedly, general relativity is a “General Theory” as well as a “Gravity Theory”. 
In the following points we discuss this issue and highlight its main implications and significance: 

e The formalism of the “Gravity Theory” is represented by the Field Equation and its consequences, while 
the formalism of the “General Theory” is represented by the combined formalism of the “Gravity Theory” 
plus the formalism of special relativity. Accordingly, the “General Theory” consists of two independent 
formalisms: gravitational formalism and special relativistic formalism. The role of the gravitational 
formalism is to set the spacetime and determine its geometric properties and consequences (i.e. metric, 
geodesics, etc.), while the role of the special relativistic formalism is to set the physical rules that should 
take place locally in the spacetime. 

e The fundamental principle which the “Gravity Theory” is based on is the weak equivalence principle 
(plus the principle of metric gravity which claims that gravity is a geometric attribute of the spacetime 
whose geometry is determined by the presence of matter), while the fundamental principle which the 
“General Theory” is based on is the strong equivalence principle. 

e The independence of the gravitational and special relativistic formalisms of each other makes the validity 
and invalidity of these formalisms (as well as the evidence for and against them) independent, i.e. the 
validity or invalidity of one of these formalisms does not imply the validity or invalidity of the other 
formalism. 

e Although there is some evidence for the validity of the “Gravity Theory’, there is no evidence for the 
validity of the “General Theory” (in fact, there could be evidence against the “General Theory” since the 
strong equivalence principle is not valid in general). This is inline with the previous point since the validity 
of the “Gravity Theory” does not imply the validity of the “General Theory”. 


Exercises 


[145] Again, the step in line 3 may seem unnecessary (but it is needed as before). 
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1. Outline the essence of the theory of general relativity as a “General Theory” that has gravitational and 
non-gravitational formalisms. 

2. Give examples of the features of general relativity that characterize its nature as a “General Theory”. 

How to generalize a special relativistic law to become a general relativistic law? 

4. Analyze and assess the procedure proposed in the previous exercise. 


oe 


Chapter 4 
Solutions of the Field Equation 


The tensorial Field Equation consists of terms of rank-2 symmetric tensors in 4D spacetime. Each one 
of these terms has 10 independent components and hence the tensorial Field Equation represents 10 
independent equations where each one of these equations is a non-linear partial differential equation. 
Accordingly, the solution of the Field Equation means solving 10 coupled non-linear partial differential 
equations. So, it is no surprise that the Field Equation is notorious for its complexity and difficulty 
to solve especially analytically. Hence, there are very few analytical solutions to the Field Equation 
related to simple cases, and therefore most solutions (if they exist at all) of general relativity problems 
are obtained computationally with the aid of numerical methods and computers. Approximate solutions 
may also be obtained from other inexact methods such as linearization (see § 3.4) and following semi- 
classical approach (where part of the problem is formulated classically). In fact, even the exact analytical 
solutions are usually based on a number of approximations and compromising assumptions (at least in 
their application). 

The main task in solving general relativistic problems is to find the appropriate metric tensor that cor- 
responds to the particular energy-momentum tensor that represents the physical situation of the problem. 
Ideally, we start from a certain physical setting (mass, energy, momentum, etc.) and we are required to 
find the spacetime geometry that corresponds to that physical setting.!'46] On obtaining the geometry 
of the spacetime, as represented by the metric tensor, the geodesic paths (which describe the trajectories 
of massive and massless free objects in spacetime) are then obtained from the geodesic equation (see § 
2.9.5). Apart from obtaining the geodesic paths, many other consequences and results can also be ob- 
tained directly or indirectly from the derived metric, as we will see later (refer for example to § 6, § 7 and 
§ 8). 

In the next sections we investigate a number of analytical solutions to the Field Equation. In fact, 
these represent a sample of the metrics that can be obtained from solving the Field Equation in particular 
physical situations. The main purpose of presenting these solutions is to have a feeling of the theory and 
its formulation and application. Therefore, the description of these solutions is very brief except the most 
important one of these which is the Schwarzschild solution where we provide derivation and rather detailed 
investigation because of its simplicity and versatile applicability as well as being a typical case study for 
representing and featuring the solutions of general relativistic problems. The Schwarzschild solution will 
also be used for developing other formal aspects of the theory that depend on the metric of the spacetime 
(see for example § 5.2, § 6.3, § 6.4, § 8.1 and § 8.2). 

We should remark that being a mathematically valid solution to the Field Equation does not imply being 
a physically valid solution. In other words, being a valid mathematical solution is a necessary but not 
sufficient condition for being a valid physical solution.!'47! Accordingly, we may obtain a mathematically 
legitimate solution which is physically nonsensical or unrealistic. Therefore, any solution that is obtained 
from solving the Field Equation should be checked meticulously for physical validity and sensibility. In fact, 
the checks for physical validity and sensibility are more required in general relativity than in most other 
theories due (among other reasons) to its exceptional mathematical and theoretical sophistication (which 
makes mathematical artifacts and theoretical fantasies more common) and the use of curved spacetime 
paradigm (which introduces further complications such as the duality of metrical and coordinate quantities 


[146] As we will see in the method that we follow in finding the solution of the Schwarzschild problem (see § 4.1.1) the 
above-described approach is a rather ideal recipe for solving the Field Equation and hence it is not followed in most 
cases. We should also refer the reader to § 5.1 for another simple (one component) application of the Field Equation 
that represents a different solution approach and strategy which can help the reader to understand and appreciate how 
the solutions of general relativistic problems are obtained. 

147] Th fact, this is true in general although it should be more obvious in general relativity. 
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as we will see in § 6). We should also note that all the methods and strategies for solving the Field Equation 
(some of which will be outlined in the questions) have serious limitations, breakdowns and traps and this 
should put more emphasis on the need for verifications and tests for physical sensibility and validity (apart 
from the required ultimate verification and validation by experimental and observational evidence). 


Problems 


1. Why is it very difficult to obtain a solution for the Field Equations? 
Answer: There are several reasons such as: 
e The Field Equations are highly non-linear and hence exact analytical solutions are generally unavail- 
able. Also, the principle of superposition cannot be used while the perturbation techniques become 
questionable. 
e The tensorial Field Equation represents 10 independent equations. Therefore, obtaining a solution 
means solving a system of 10 simultaneous second order partial differential equations which is very 
difficult (if not impossible). 
e The Field Equations are tensor equations in 4D spacetime and hence they involve many variables, 
components, indices, etc. and that makes them very complicated and confusing. Also, tensor calculus 
notations and techniques in such a space are highly sophisticated and messy. 
e In realistic physical situations it is difficult to formulate the physical ingredient of the Field Equa- 
tions which is the energy-momentum tensor. Hence, a number of approximations, compromises and 
sophisticated tackling strategies are usually required to make the problem solvable. 

2. Explain why the Field Equations of general relativity are non-linear. 
Answer: Because the Einstein tensor is a non-linear function of the metric (refer to § 2.13). Non- 
linearity may also affect the energy-momentum tensor. 


Exercises 


1. We may describe a solution of a scientific problem as “ideal solution”. Discuss this issue and indicate 
the approximation methods that are generally used in science to obtain second best solutions. 

2. Outline some of the approximation methods that are used to overcome the technical difficulties in 

solving general relativistic problems. 

What “solving the Field Equations” means? 

4. Outline the steps that should be followed in formulating and solving gravity problems according to 
general relativity. 

5. Outline some alternative tackling strategies that are used in formulating and solving general relativistic 
problems. 
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4.1 Schwarzschild Solution 


The Schwarzschild solution (or metric) is one of the simplest analytical solutions to the Field Equations 
and it is the first solution to these equations from a historical perspective. The solution is used to describe 
the geometry of spacetime around various types of astronomical objects such as stars and planets and 
hence it provides a basis for the solution of many astronomical problems. It also provides a basis for 
many predictions and results related for example to the gravitational time dilation (see § 6.3.3) or the 
gravitational light bending (see § 8.2) or the physics of static classic black holes (see § 8.7) and therefore it 
is one of the most important solutions in general relativity with many instantiations and applications. In 
fact, most of the investigations in general relativity are based on or related to the Schwarzschild metric. 

The Schwarzschild solution is a spherically symmetrical and time independent metric that describes the 
geometry of spacetime exterior to a static, spherically symmetric and electrically uncharged gravitating 
body of mass M surrounded by empty space. It should be remarked that “static” here means that the 
object does not vary in space or/and time (e.g. by translation or rotation in space or by changing its 
mass density or distribution in time) while “spherically symmetric” means that its mass distribution has 
a spherical symmetry and hence its mass density is a function of the radial coordinate r alone (i.e. with 
no dependency on 6 or ¢).!!48l Also, “empty space” means space void of energy (e.g. electromagnetic) as 


1148] Th fact, our use of “static” here is rather different from the common use of this term in the literature of general relativity 
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well as matter. We should also note that “spherically symmetric” condition suggests that the object has a 
finite size and hence it may exclude point-like objects (even as approximation) or black holes if they are 
considered real physical singularities. So, to avoid any potential challenges or disputes we can generalize 
this by claiming that spherical symmetry includes even geometric points (i.e. these points are spherically 
symmetric). 

It should be obvious that “exterior” (associated with “surrounded by empty space”) in the above descrip- 
tion means that the metric is a vacuum solution, i.e. it corresponds to a region of spacetime where the 
energy-momentum tensor vanishes, that is T,,, = 0. Accordingly, the Field Equation reduces to Gy, = 0 
and hence R,,, = 0 (noting that G,, = 0 iff R,, = 0 as demonstrated in exercise 7 of § 2.13; also see 
exercises 8 and 9 of § 3.2). In brief, the solution corresponds to the spacetime region where R,,, = 0. 
As will be stated later, we are assuming zero cosmological constant (see § 3.3). In fact, we state this for 
formality and clarity; otherwise the problem is not cosmological and hence the cosmological term is not 
supposed to exist in this context (or at least it is utterly negligible). 


4.1.1 Derivation of Schwarzschild Metric 


In this subsection we present a rather simple derivation of the Schwarzschild metric. However, before that 

we need to draw the attention to the following points about the solution and how it is obtained: 

1. As indicated earlier, obtaining a solution means obtaining the metric of the spacetime. So, our objective 
in this subsection is to obtain the metric tensor (as well as the line element which is based on the metric 
tensor) of the spacetime of the Schwarzschild problem (i.e. static, spherically symmetric and electrically 
uncharged gravitating body etc.). 

2. The procedure that is followed in obtaining the solution is to start with a guess about the generic 
form of the metric tensor (or line element). This guess is then used with some assumptions, insights 
and guidance by the classical gravitation theory and special relativity to obtain the final solution. As 
we will see, the main assumptions and insights are the spherical symmetry and time independence of 
the solution (which are essentially justified by the spherical symmetry and time independence of the 
source of gravity although we should take account of the upcoming Birkhoff theorem).!!49l 

3. From a physical perspective, the Schwarzschild object is characterized by a single physical parameter 
which is its mass.!45° Hence, from this perspective the Schwarzschild metric must be a function of the 
mass only. Also, from a geometric perspective, the metric tensor in general is a function of spacetime 
coordinates and hence in principle the Schwarzschild metric must be a function of these coordinates 
as well. So in brief, the Schwarzschild metric must be a function of the mass and coordinates. 

4, We use a spherical coordinate system (centered on the center of the gravitating body) for coordinating 
the spatial part of the spacetimel!!®4] and hence the coordinates of any point in the spacetime are 
given by (ct,r,@,@) which are commonly called the Schwarzschild coordinates.!!©?]_ The justification 
of this choice is the spherical symmetry of the problem which makes this choice intuitive. In brief, the 
spherical symmetry of the gravity source (i.e. gravitating body) implies the spherical symmetry of the 
metric tensor and this justifies the desirability of using a spherical coordinate system for coordinating 
the spatial part of the spacetime. 


which means time independent (or stationary in time) and reversible in time. However, our meaning is not far from 
this. We should also note that r, 0, refer effectively to spherical coordinates centered on the gravitating object (as will 
be clarified further later on). It is also important to note that certain restrictions imposed by “static” condition will be 
lifted by the upcoming Birkhoff theorem. 

The essence of the Birkhoff theorem is that all spherically symmetric gravitational fields in vacuum are static (i.e. even 
if the source of gravity is not static) and they are uniquely given by the Schwarzschild metric. In fact, it may also be 
stated as: all gravitational fields in vacuum generated by spherically symmetric gravitating objects are static (even if 
the objects are not static). However, this statement is less general. 


149 


150] Tn fact, mass here should include energy (due to the mass-energy equivalence) as long as the other conditions of the 


Schwarzschild problem are respected. 
151] Tn fact, this system applies strictly only in the regions of space that are very far from the gravitating object where the 
spacetime becomes flat. This will be clarified later. 

We note that the Schwarzschild metric can be obtained using other coordinate systems, e.g. Eddington-Finkelstein 
coordinates. However, in this book all the coordinate-specific investigations of the Schwarzschild metric are based on 


the use of Schwarzschild coordinates. 
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5. Because of the presumed spherical symmetry, the Schwarzschild metric should be independent of the 
spatial coordinates @ and ¢ (i.e. independent of orientation and direction relative to the center). For 
the same reason (i.e. spherical symmetry) the line element should be reversible in 6 and ¢ and hence 
it should not vary by the transformations dé ~+ —dé@ and d¢ 4 —d¢. 

6. The physical setting of the Schwarzschild problem dictates that the Schwarzschild metric should be 
independent of time. This is justified by the fact that according to the presumed physical setting and 
the stated assumptions of this problem there is nothing that varies in time and hence the metric must 
not depend on time. We should also assume that the Schwarzschild line element (which is based on the 
Schwarzschild metric) does not vary by reversing the sign of time (i.e. replacing dt with —dt) because 
the line element should be reversible in time since the object is static and the sign of time in this case 
is rather arbitrary. In fact, the reversibility in time is mainly a demonstration of the fact that the 
object is not rotating (as compared for example to the upcoming Kerr solution; see § 4.2). 

7. To sum up, the Schwarzschild metric tensor should be independent of the coordinates ct, #,@ and hence 
it should be a function of r and M only (where M is the mass of the gravitating object). Moreover, the 
line element should be reversible in t, 0, and hence it should be invariant under the transformations 
dt ~ —dt, dd + —d@ and dd © —d¢. 

8. As the distance from the gravitating body increases the curvature of the spacetime diminishes pro- 

gressively, and hence at very large distance from the gravitating body (i.e. r — oo) the spacetime 
is effectively flat. Therefore, at regions very far away from the gravitating object the Schwarzschild 
metric should converge asymptotically to the Lorentz metricl!®*! of special relativity (see § 2.5). Sim- 
ilarly, as the mass of the gravitating object M decreases the curvature of the spacetime diminishes 
progressively, and hence as the mass tends to zero (i.e. M — 0) the spacetime becomes effectively 
flat Minkowskian. Therefore, when the mass vanishes (i.e. M = 0) the Schwarzschild metric should 
converge to the Lorentz metric of special relativity. 
In brief, the expected metric should converge to the Lorentz metric of special relativity in certain 
limiting cases, i.e. the case of distant regions (r + oo) and the case of vanishing mass (IM = 0). These 
limiting cases provide guiding conditions during our derivation of the Schwarzschild metric. Moreover, 
any metric that we obtain as a solution to the Schwarzschild problem must yield the Lorentz metric 
of special relativity in these limiting cases and hence the solution must be rejected if it does not pass 
these tests (see exercises 6 and 7). 

9. We consider the Field Equation without the cosmological term since this term is not supposed to exist 
in this type of problems (i.e. non-cosmological). This can also be understood from the aforementioned 
fact that we are solving the following Field Equation: R,, = 0. 


To derive the Schwarzschild metric, we start from the Field Equation R,,, = 0 since the problem is a 
vacuum problem. Based on the insight that we gained from the above points we can guess that the 
Schwarzschild quadratic form in its generic and most general form must be given by the following expres- 
sion: 


(do)” = goo (edt)” + gi (dr)” + g22 (d0)” + 933 (do)? 
= fo (cdt)? — fy (dr)? — r? (dd)? — r? sin? 6 (dd)? (121) 


where fo and f; are real positive functions of r and M only that should converge to 1 as r + oo (due 
to the boundary condition at infinity, i.e. the convergence to the Lorentz metric of special relativity at 
infinity) and as M — 0 (due to the demand that the Schwarzschild metric should reduce to the Lorentz 
metric when the spacetime is void). 

Now, at this point we need to provide a number of justifications for the above form which looks restricted 
and not as general as we claimed: 
(a) The justification for the + — — — sign pattern of the terms in line 2 is that we expect the metric 
to converge to the Lorentz metric of special relativity (which has the + — —— sign pattern) at certain 
limiting cases (i.e. distant regions r + oo and vanishing mass M = 0) and hence the Schwarzschild metric 


[153] We use “Lorentz metric” to be more general since “Minkowski metric” may be restricted to the form diag [1,-1, -1, -]] 
and its variants (like diag [—1, 1,1, 1]) which are based on using Cartesian system for coordinating the spatial part. 
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should resemble the Lorentz metric in this sign pattern (or signature) .!154] This should also justify the 
requirement that fo and f are real positive functions that converge to 1 as r > oo and as M > 0. 

(b) The justification for fo and f; to be functions of r and M only has been outlined earlier, i.e. the 
time independence of the physical setting (and hence fp and f; are independent of t) and its spherical 
symmetry (and hence fo and f; are independent of @ and ¢) as well as the sole physical characterization 
of the gravity source by its mass M. 

(c) The justification for the form of go2 and 33 (i.e. go2 = —r? and g33 = —r? sin? 8) is the use of spherical 
coordinates due to the presumed spherical symmetry (as well as the convergence to the Lorentz metric of 
special relativity at the above limits). 

(d) The justification for the absence of the mixed differential terms (i.e. terms involving dxdx” where 
p # v) is the fact that the mixed terms that involve time differential (i.e. dt) are zero because the 
line element is reversible in time and hence it should be independent of the sign of dt, while the mixed 
differential terms that do not involve time differential (i.e. those involving only space differentials such as 
drd@) are zero because the line element must be independent of the sign of d@ and dé due to the spherical 
symmetry of the problem. In brief, the Schwarzschild line element has no mixed differential terms and 
hence the Schwarzschild metric should be diagonal. 

So, what we need in order to have a completely definite form of the Schwarzschild metric is to obtain 
the form of fo and f;. It can be argued that if we have to keep the above sign pattern and satisfy the 
boundary condition at infinity then we need to use exponential functions (which are non-negative), i.e. 
fo = e# and f; = e* where H and K are functions of r only!!®5! that should vanish at infinity (i.e. 
H = K =0 as r — ov) as required by the indicated boundary condition at infinity (i.e. convergence to 
the Lorentz metric at this limit).!5°1 Accordingly, we should have (refer to Eq. 121): 


[Quy] = diag [e™, —e*, —r?, —r* sin? 6] (122) 
[eM] =) digg (e-" psec") 8-7 Asin (123) 


As stated before, we are supposed to solve the tensorial equation R,,, = 0 and hence we need to find the 
coefficients of the Ricci curvature tensor R,,,, set them to zero and solve the resulting system of equations 
Ryv = 0 simultaneously to obtain the Schwarzschild metric in its final and definite form (by determining 
the functions H and Kk). Now, the coefficients of the Ricci curvature tensor are given by (see Eq. 89): 


= Qa a B a B a 

i= ON hy = Ogle Lat By = aol poy (124) 

Hence, we need to find the Christoffel symbols first where these symbols are given by (see Eq. 52): 
ae 
Cy = ae (Orgue Te OnJva = OaGuv) (125) 
Accordingly, our plan in order to obtain the definite form of the Schwarzschild metric is to obtain the 
Christoffel symbols using Eq. 125 with the aid of Eqs. 122 and 123. We then find the coefficients of 
the Ricci curvature tensor R,,, using Eq. 124. We then form the system of equations R,,, = 0 and solve 
them simultaneously (subject to the aforementioned assumptions and boundary conditions) to obtain 


the Schwarzschild metric in its definite form by identifying the functions H(r) and K(r). In fact, there 


[154] “Sionature” in this context means the number of positive metric coefficients (or terms of quadratic form) minus the 


number of negative metric coefficients noting that the metric is diagonal. So, according to the sign pattern + — — — the 
signature is 1 — 3 = —2 while according to the sign pattern — +++ the signature is 3— 1 = 2. 

1155] From now on, we ignore the dependency of fo and f1 (and hence the dependency of H and K) on M because for a given 
Schwarzschild object the mass is fixed (i.e. M = constant) and hence fp and f; are functions of r only although from 
the point of view of a general metric they are functions of both r and M. As we will see, M will be incorporated in the 
metric through a constant of integration with the help of the classical limit guidance, and hence M will be embedded 
into H and K (and hence fo and f;) as a constant. So we may say: H and K depend on M as a constant and on r as 
a variable (noting that H and K should vanish as M = 0 and as r - oo). 

1156] The choice of exponential functions may be additionally justified by being a convenient form to work with. Anyway, 
the exponential form is an acceptable form to use according to the imposed conditions and requirements and hence it 
does not require further justification. 
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are other plans for obtaining the definite form of the Schwarzschild metric such as solving the tensorial 
equation G,,, = 0 directly instead of solving the equation R,,, = 0 (which is equivalent to the equation 
Gv = 0 as seen in exercise 7 of § 2.13). However, in our view the method that we follow is the clearest 
and most straightforward although the other methods may also have some advantages. 

Regarding the Christoffel symbols, it can be shown (see exercise 11) that the non-vanishing Christoffel 
symbols are only: 


0 _ po —t1ip 
Vor =Vo = gt 


Ti, = de#-* A ri, = 4K’ Ti, = —re-* Th, = —re~* sin? 6 
12 
ie ee 13, = —sin@cos0 i 
to aa 
Ms =T3. = 5 P33 = P32 = cot d 


where the prime means total derivative with respect to r (i.e. d/dr). It can also be shown (see exercise 
12) that the non-vanishing coefficients of the Ricci curvature tensor are only the diagonal elements, that 
is: 


H'K' (H')? HH" H! 
Roo = aa oe ; =| (127) 
Ry = as ay r x (128) 
Ro = e * p+ (129) 
R33 = sin? fe“ ji | - 1 (130) 


Now, solving the system of equations R,,, = 0 simultaneously means solving the system of the last four 
coefficients simultaneously (since the other coefficients will produce the trivial equation 0 = 0), that is: 


H'K! (H')? H" H! 
H-K = 131 
: ( 4 4 2 e sy 
H'K! (H’)? H" K! 
Fi t 1 + 5) FA = 0 (132) 
H'— kK’ 
eK i me es Ah SG (133) 
1 oq 
sin? fe“ c + | - i} =. (134) 


Now, since e?—* ¥ 0 and sin? @ is not identically zero!!®7| then the last four equations will reduce to: 


H'K! (H’)? HY" H’ 


— 1 
4 4 2 Tr a 85) 
H'K! (H')? H" K! 
= 1 
4 4 2 r g oe) 
H'— Kk’ 
eK f gas ; ’ oe (137) 
H'— Kk’ 
eK p+ yy, 2:9 (138) 


[157] Ty fact, sin? 6 is zero only at the poles which should be excluded. 
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As we see, the last two equations are identical and hence the system will reduce to the following three 
equations: 


H'K! (H’)? HY" H’ 


1 a a al . (139) 
H'K’ (HY? HY KI 
ra ( a t>-= = 0 (140) 
eK f ! ar =| aoe (141) 
Now, if we add the first two equations we get:!!5°| 
2 a = x =0 that is (Pa (142) 


and hence on integrating both sides with respect to r (noting that H and K are functions of r only) we 
get: 
H=-K+C (143) 


where C is the constant of integration. Now, since H and K should converge to zero at infinity (according 
to the aforementioned infinity boundary condition)!!>°! then C = 0 and we get: 


Rok (144) 


On substituting from Eqs. 142 and 144 into Eq. 141 we get: 


ee i+ = 0 (145) 
ef? (14+rH’) = 1 (146) 


It can be shown (see exercise 13) that this differential equation has the following solution: 


er 


Tr 


ef =1- (147) 
where C, is a constant. By using the Newtonian gravity theory (see § 1.5) in conjunction with the 
correspondence principle (see § 1.8.3) at the classical gravity limit it can be shown (see the upcoming notes 
in the end of this subsection) that C, is equal to the Schwarzschild radius!!! (ie. C, = Rs = 7S"), 
and hence: 


IGM 
Mf = 1- (148) 
Cor 
1 IGM \~" 
K —H 
= he 149 
e 7 ef ( er ) 9) 


On substituting from the last two equations into Eqs. 122 and 123 we get the Schwarzschild metric in its 
definite form, that is: 


=a 
[9] = diag (2 oe) (1 S| a eae Panta (150) 


cr Cr 


[158] We note that r # 0 because the problem is a vacuum problem in a region exterior to the gravitating body which is 
positioned at r = 0. 

[159] We may also use the condition H = K =O as M=0. 

[160] The fact that Cs is equal to the Schwarzschild radius (which has the dimension of length) is consistent with the 
requirement that C’;/r must be dimensionless since r has the dimension of length. For the definition of Schwarzschild 
radius, see exercise 14. 


4.1.1 Derivation of Schwarzschild Metric 109 


-1 
[gh] = diag (2 —) (1 —) r 2 roe sin72 fa) (151) 


cr cr 


The last two equations represent the Schwarzschild metric tensor in its covariant and contravariant definite 
form, i.e. it is the required solution of the Schwarzschild problem. 

Finally, we should draw the attention to the following important notes: 
e The above derivation is based on several simplifications and restrictions; moreover the final solution was 
in sight (at least in its form) prior to the derivation since we started with a given guess about how the 
solution should look like. Therefore, it might be suspected that the obtained solution is not general and 
a different solution might have been obtained if we followed a different course of action. However, despite 
the aforementioned limitations and preconception it can be shown that the obtained solution is general 
within the stated assumptions, i.e. all the static spherically symmetric solutions of the vacuum tensorial 
equation R,,, = 0 subject to the given boundary condition at infinity (as well as the stated physical setting 
and assumptions of the Schwarzschild problem) are equivalent to the obtained Schwarzschild solution. The 
interested reader should refer to the Birkhoff theorem in the literature of general relativity (noting that 
this theorem contains more generalizations than what we are using and demanding here).|64] 
e To obtain a specific expression for C, (see Eq. 147) and hence determine the definite form of the 
Schwarzschild metric completely, we use the fact that in the classical limit (see § 5.1) the component goo 
of the Schwarzschild metric tensor is approximated by: 


20 2GM 


~l14 =I 152 
Joo eo) Die (152) 


where in the second equality we use ® = —GM 


exercise 2 of § 1.5). Accordingly: 


with ® being the classical gravitational potential (see 


_ 2GM 


—— 


; (153) 


The details of this derivation are given in § 5.1. 

e Regarding the use of the spherical coordinate system to represent the spatial part of the spacetime and 
the meaning of the Schwarzschild coordinates in general (and indeed the interpretation of the coordinates 
in general relativity to be more general), we refer the reader to § 6.2, § 6.3.2 and § 6.4.2. 

e In the literature of general relativity there are some examples of Schwarzschild geometry generated by 
gravitating objects with non spherical symmetric matter distribution. If so, the spherical symmetry of 
the matter distribution is a sufficient but not necessary condition for the generation of Schwarzschild 
geometry (assuming that other conditions are satisfied) although the derivation of the metric in these 
examples should follow a different route since our derivation is based on the assumption of spherical 
symmetry of the source of gravity (in fact we characterized the source of gravity as a static, spherically 
symmetric and electrically uncharged gravitating body). It is worth noting that the reader should not 
confuse the spherical symmetry of the matter distribution of the gravitating object with the spherical 
symmetry of the solution (noting that in our argument in this subsection we based the latter on the 
former and considering that this should be consistent with being a sufficient but not necessary condition 
as stated above). 


Problems 


1. What about the metric of the spacetime inside the gravitating object of the Schwarzschild problem? 
Answer: The metric inside the object is different from the metric outside the object (where the metric 
outside is the Schwarzschild metric). The reason is that in the outside region the metric belongs to 
vacuum (i.e. T,, = 0 and hence R,,, = 0) while the metric inside the object is not (i.e. T,,, 4 0 and 
hence R,,, # 0 since R,,, = 0 iff T,,, = 0 according to exercise 9 of § 3.2). However, the two metrics 


[161] The Birkhoff theorem may be stated succinctly as follows: any solution of the vacuum Field Equation of a spherically 
symmetric (non-rotating) gravitating object is given by the Schwarzschild metric. This means that in the vacuum region 
the solution is static (where static here means time independent metric and time-reversible quadratic form) even if the 
gravitating object is not static (e.g. pulsating spherical-symmetrically). 
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should match at the boundary (ie. the surface of the gravitating object) to ensure the continuity of 
the metric in the entire spacetime. 
We note that in this answer we are assuming that the gravitating object has a finite size and hence the 
spacetime is divided into inside the object and outside the object. So, if we assume that the gravitating 
object is point-like then the Schwarzschild metric corresponds to the entire spacetime (excluding the 
point where the object is located). We should also note that the metric inside the object is not unique 
since it depends on the equation of state of the object and its physical and geometric properties (e.g. 
fluid or solid, of uniform density or not, etc.). 

2. Show that the Schwarzschild metric is time independent and spherically symmetric. 
Answer: The Schwarzschild metric tensor is given by: 


IGM 2GM\~" 
1 e ‘ (1 = ) ee Panta 


[dur] = diag Cr Cr 


As we see, there is no temporal variable in this metric and hence it is time independent. Moreover, 
all the components depend on r only (noting the constancy of M/) and hence it is spherically symmet- 
ric.!!62] We note that the dependence of the last component on 6 is accidental (due to the choice of 
coordinate system) and hence it does not affect the spherical symmetry of the physical situation (as 
can be verified by rotating the coordinate system or the gravitating object; also see exercise 6 of § 4.2). 
In brief, the physical situation dictates spherical symmetry despite this mathematical artifact which 
suggests otherwise. In fact, this should put a question mark on the inferred time independence and 
symmetry with respect to ¢ as mathematical consequences rather than physically inferred conclusions 
that are supported by the mathematical form of the metric.!!®! So, the final judgment belongs to 
physics which should validate or invalidate any feature and artifact suggested by the formalism of 
mathematics. 

Note 1: from a formal perspective the Schwarzschild metric!!®4! is not ideal since it has several mis- 
leading artifacts and features related to the choice of coordinates such as this ostensible spherical 
non-symmetry and the accidental singularity at the Schwarzschild radius (which will be investigated 
later). This should highlight the issue (which we advocate continuously) that mathematics is not such 
a perfect and ideal tool when used in physical sciences and hence we should be careful in reading and 
interpreting our mathematical formulations and models of physical phenomena (see § 1.10). 

Note 2: according to the circulating literature of black holes, the spacetime inside the event horizon 
of Schwarzschild black hole is neither time independent nor reversible in time (i.e. it is not stationary 
or static according to the common terminology). This is because nothing can stand still inside the 
horizon.!!®5! Accordingly, in the case of black holes the above-claimed time independence should be 
restricted to outside the event horizon. However, the interpretations and consequences of some of 
these issues should be treated with caution due to the many fantasies and questionable issues in the 
circulating physics of black holes. In fact, we may even question the meaning of time dependency and 
irreversibility in time in these claims although we do not discuss this issue due to its pedantic and 
hypothetical nature. 


Exercises 
1. What is the Schwarzschild solution? 


2. Summarize the main characteristics of the Schwarzschild solution. 
3. Summarize the procedure that we followed to find the Schwarzschild metric. 


[162] Being spherically symmetric means it is independent of 6 and ¢. 

[163] Ty fact, the mathematical form is obtained based on the above physical considerations and assumptions and hence the 

mathematical form actually reflects the physics (rather than being suggestive of the physics). 

“Schwarzschild metric” here should mean the solution of the Schwarzschild problem that is based on the use of the 

Schwarzschild coordinate system. In other words, it is “Schwarzschild metric in the Schwarzschild coordinates”. 

[165] This is also linked to the exchange of the roles of time coordinate and radial coordinate and the change of their 
nature where t becomes spacelike and r becomes timelike and hence the metric will have time dependency through its 
dependency on r. The inevitability of moving toward the singularity inside the horizon is attributed to the inevitability 
of moving in time toward the future since r inside the horizon is timelike. 


[164] 
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19. 


20. 


. Write down the Schwarzschild metric tensor and the Schwarzschild quadratic form explaining all the 


symbols used. 


. Write the Schwarzschild metric tensor in terms of the escape speed ve, and in terms of the Schwarzschild 


radius Rg. 


. Verify that the Schwarzschild metric converges to the Lorentz metric of special relativity very far away 


from the gravitating object. 


. Verify that when the mass vanishes the Schwarzschild metric converges to the Lorentz metric of special 


relativity. 


. Based on the last two exercises, analyze the relation between special relativity and general relativity 


and how the latter is seen to converge to the former in the absence of gravity. 


. What are the types of singularity and how you define them? 
. Show that there is a singularity in the Schwarzschild quadratic form. What is the physical interpreta- 


tion of this? What is the type of this singularity? 


. Show that the non-vanishing Christoffel symbols of the Schwarzschild spacetime (in the Schwarzschild 


coordinates) are those given by Eq. 126 and verify that these symbols are as given by Eq. 126. 


. Show that the non-vanishing coefficients of the Ricci curvature tensor of the Schwarzschild spacetime 


(in the Schwarzschild coordinates) are Roo, R11, Reo and R33 and verify that these coefficients are as 
given by Eqs. 127-130. 


. Show that the differential equation e” (1 +1rH’) = 1 has the following solution: e” = 1— ©: where 


C’, is a constant and the prime represents derivative with respect to r. 


. What is Schwarzschild radius? 
. Verify that the solution found in exercise 13 satisfies the system given by Eqs. 139-141. What this 


means? 


. Show that the definite form of the Schwarzschild metric that we obtained in the text (Eqs. 150-151) 


satisfies the vacuum Field Equations R,,,, = 0. 


. Obtain the non-vanishing Christoffel symbols of the Schwarzschild metric in their definite form (refer 


to Eq. 126). 


. Use the Schwarzschild metric to obtain a mathematical condition for the weak field approximation (i.e. 


where Newtonian gravity is a good approximation to general relativity). 

In § 1.3 we defined weak gravitational field as a field in which the gravitational potential energy of 
a test particle is negligible in comparison to its mass energy. Try to correlate this definition to the 
mathematical condition that is obtained in the previous exercise. 

What is the essence of the Birkhoff theorem? 
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On using the Christoffel symbols that were derived in § 4.1.1 it can be shown (see Problems) that the four 
components of the tensorial geodesic equation in the Schwarzschild metric are: 


dt _ 2GM dt dr 


De ie ava. ft (154) 
2 2 2 2 
wet Sr (5) ~aeer (GB) (GR) ~rPamte(G) = 9 8 
a ee sindooso (2) = 0 (156) 
rr 


where F = 1 — 2G . It should be obvious that the tensorial geodesic equation is a rank-1 tensor (i.e. 
vector) equation in 4D spacetime and hence it should have four components. 


We will also show in the Problems the following results: 
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(A) The above system of geodesic equations (i.e. Eqs. 154-157) can be reduced to the following system 
of equations: 


FH =A 1 
a (158) 
du GMA2 GM (/du7\?’ 
ip BEF + paar ( ~) +Fu = 0 (159) 
dp 2 
jx = Bu (160) 


where A and B are constants and u = 1/r.|16¢ 


(B) For massive objects the reduced system of Eqs. 158-160 can be simplified to the following system of 
equations: 


dt 
Hee i (161) 
du GM 3GMu? 
de +uU = Bz + 72 (162) 
do 2 
-= u (163) 


where we use the proper time parameter 7 as affine parameter. 


(C) For massless objects the reduced system of Eqs. 158-160 can be simplified to the following system of 
equations: 


dt 
Fo = A (164) 
du 3G Mu? 
aga = 2 (165) 
“ a Bi (166) 


Problems 


1. Show that the four components of the tensorial geodesic equation in the Schwarzschild metric are as 
given by Eqs. 154-157. 
Answer: The tensorial geodesic equation is given by: 


cee re dx® dat _ 
dd2 ° PY dy dy 


Now, if we symbolize (1 - 2G) with F and use the results of exercise 17 of § 4.1.1 then we have: 
e For a = 0 the non-vanishing Christoffel symbols are I’, and ['},. Hence: 


d?x° ne dx® dx! a0 dx! dx® ai 
ane hay dx ad 
d?x° eee dx® dx! 7 

dd2 | Pou d\ dk o 

d?x° a GM dx® dz! cag 
dd? | @r2F dd dd 

d? (ct) 2GM d(ct)dr 0 


dd? aaa dX dr 


[166] We should note that A here is not the same as A in the orbital shape equations of § 1.5.1 (see for example Eq. 17) 
although both are constants. 
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ah ,2GM ddr _ 
d\2° cr2Fd\d\ 
which is Eq. 154. 
e For a = 1 the non-vanishing Christoffel symbols are jg, ['j,, [4 and ['4,. Hence: 
d?x} Tr dx® dx® opi dz! dx! ag dx? dx? jp dx? dx? ar, 
dd? OO dx ad Gk. OPEN cao) a) nn 
@z! GMF (dx®\? GM (dzx\’ dx? \” dx3\” 
F Fsin?96(—) = 0 
ae" eer? (=) sap (Gr) i (=) ae (=) 
@r GMF (dt\* GM (dr \’ do \” do\? 
F F sin? 6 (| — = 
a2 2 (=) CrP (5) e (=) 7s (F) 
which is Eq. 155. 
e For a = 2 the non-vanishing Christoffel symbols are '7,, 3, and 33. Hence: 
d? x? a dx! dx? — dx? dx sida dae é 
i) Cas Oe) Ras Fae 3 dk dA 
dx? Py dx! dx? 2 dx? dx? _ 9 
aye ea aX Say dy 
de 2 dar! dat sind cos = 0 
d\2 " r dX dX 7) 
d20 2dr d0 do\? 
in 0 cos 0 = 
Tee ($) 
which is Eq. 156. 
e For a = 3 the non-vanishing Christoffel symbols are '?,, [3,, [33 and ['3,. Hence: 
dx? 2 oi dx! dx? acy dx? dx! aig dx? dx? oe dx? dx? _ 9 
Ge EN Oe Pe Da aa 
d? x? dx! dx? dx? dx? 
+ 2r3 3,—-—- = 0 
ae Gy ae) aN 
ax? 2dx' dx - gd dx? _ 9 
ae eddy eke a 
ao 2drdd do dd 
ie eae : 
which is Eq. 157. 
2. Show that the system of Eqs. 154-157 can be reduced to the system of Eqs. 158-160. 
Answer: We start from the system of Eqs. 154-157, that is: 
dt  22GM dt dr 
= 1 
dd? cr? F d\ dX ey) 
@r GMF (dt\* GM (adr ’ do \? do\? 
: F F sin? 6 ( — = 1 
ae re (5) Op (=) (5) oe (3) H mee) 
a0 2drdo | do\? 
D2 + = A sind coso ($5) = 0 (169) 
@2h 2drdd do dd 
a ae eo eoto—— = 1 
De peas a : ro) 


In the following points we show step by step how the system of Eqs. 167-170 can be reduced to the 
system of Eqs. 158-160. 
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(a) Eliminating Eq. 169: if we assume that initially the gravitated object is in the plane 0 = 7/2 and 
the first order variation of its 9 dependency vanishes (and hence initially cos@ = 0 and ap = 0; refer 
to the upcoming note) then Eq. 169 becomes: 

d?0 

dy? 
On differentiating this equation again and again we conclude that the A-derivatives of 6 of all orders 
are zero (i.e. ae ae --- = 0) and hence @ has no dependency on the affine parameter 4.!!67 
This means that @ is constant and hence if the gravitated object moves initially in the plane 6 = 1/2 
then it will stay in this plane permanently. This result is consistent with the classical mechanics where 
the trajectory of a gravitated object in a central force field is confined to a plane which is determined 
by the initial conditions, i.e. the position and velocity of the gravitated object (with A standing for 
time). Accordingly, Eq. 169 will reduce to the simple condition 6 = 7/2 and hence the system of Eqs. 
167-170 will reduce to the following three equations (noting that a = 0, sin? 6 = 1 and cot @ = 0 since 
0 = 71/2): 


=0 


@t  2GM dt dr 


i @earaa (171) 
@r GMF (dt\* GM (dr ? do\? 
pole (=) ork (=) F (3) = re) 
Po 2dr dd 
ao 2drdpe _ 1 
Be a a a) 


(b) Simplifying Eq. 171: if we note that 244% = (since F = 1— 2S“) then Eq. 171 can be 
simplified as follows: 


@t  1dFdtdr 


De Raa 
@t idk dt —_ 0 
ie" F dvds 

@t dF dt —_ 0 

oe didi 

d (_dt 

5 (FS) pata 
dt 
ae | 
dy 


where in line 2 we use the chain rule, in line 3 we multiply by F, in line 4 we use the product rule, and 
in line 5 we integrate with respect to » (with A being a constant). Accordingly, the system of Eqs. 
171-173 becomes: 


dt 
F— =A 174 
dX re) 
@r GMF (dt\* GM (dr ’ do\? 
F = 1 
de! 7 (=) Cre F (=) ‘ ($) : ae) 
2d 2dr dd 
ee ee 1 
Mo son ue 
(c) Simplifying Eq. 176: we have: 
#6 , 2dr do 
dX?" rd\d\ 


267] I fact, this may be expressed more technically and elegantly in terms of a Taylor series. In brief, the essential point 
in the above argument is that since 6 and all its derivatives vanish initially then it (with its derivatives) should vanish 
permanently (which is formally demonstrated by a Taylor expansion about the initial point). 
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Eb. Grae 
2 
—— ss » —_ —— — 
Ope a ° 
d ( .dp~\ _ 
ara) = 9 
do 
2 
BEY ee IB 
Peak 


where in line 2 we multiply by r?, in line 3 we use the product rule, and in line 4 we integrate with 
respect to A (with B being a constant). Accordingly, the system of Eqs. 174-176 becomes: 


dt 
Fy = A (177) 
@r GMF (dt\* GM (adr \’ do\? 
F = 1 
geo” og (=) OF (=) ‘ (F) : no) 
pe 5 i (179) 
(d) Simplifying Eq. 178: if we substitute from Eqs. 177 and 179 into Eq. 178 we get: 
@r GMF(A\*? GM (dr\? B\* 
che rP = 0 
dd? r2 F cr2F \ dx r2 
Pr | GMA? GM (dr\* FB 
d\? °F Cr2F \ dr rs 
Accordingly, the system of Eqs. 177-179 becomes: 
dt 
FY = A (180) 
dr GMA? GM (dr\* FB? 
wet rk Crk (=) Po ey) 
d 
pe = 8 (182) 
(e) Developing expression for ae by the chain and product rules we have: 
dr dr do 
@r dr (db\? | dr do (184) 
d\2 ~—s dg? \ dX do dr? 


Now: 


@> _ d (dd\_ ad (B\_a(B\dr __ Bar 
a2 dd \d\)~ d\ \r?) ~~ dr \7?) dX 3 dd 


where Eq. 182 is used in step 2 and the chain rule is used in step 3. On substituting from the last 
equation into Eq. 184 we get: 


ee @r (dd\* .Bdrdr 
D2 dd? \dr 73 dd dé 


— @r (BY? 9B (adr de ar 
dd? (2) re G DS do 
_ Be@r .B(dr\* B 

r* dd? r3 ($) 2 


r 
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. Ber. 2B? far \ 
rt dd? r> \d@d 


where in line 2 we substitute from Eqs. 182 and 183, and in line 3 we substitute from Eq. 182. On 
substituting from the last equation into Eq. 181 we get: 


B? dr 2B? (dr\* | GMA? GM /(dr\’_ FB? _, 
r* dd? r> \d¢d) ° PF cr?F \ dA rs 
Accordingly, the system of Eqs. 180-182 becomes: 
dt 
Ba = A 185 
B? Pr 2B? (dr\* | GMA? GM (dr\* FB? _ (186) 
r* dd? r? \dé rk cr2F \ dr rs 
de 
an = 
er B (187) 


(f) Introducing new dependent variable u: if we introduce a new dependent variable u = 1/r (and 
hence r = 1/u) then we have: 


dr 1 du 


dé —uw2d@ tee) 
d?r 2 (du\* 1 du 
deze us (F) u2 de (189) 


1 du\? GMA?u2— GMu2 (du7!\’ 
2,5 Pit peated 
2B*u ( a5) + 2 ( ) FBew = 0 


2 2 oked 
2B. (5) Bee 2B. (5) GM A2u2 GMu? (S -) FBR = 0 


do dd? do F Cr dX 
@u GMA2u2 GMu? (du-\? 
B? 2 FB? 3 <— 
“UP F OF ( dd ) - : 


+Fu = 0O 


du GMA? | GM (du71\* 
dé? BF | Be2F \ dx 


where in equality 1 we substitute from Eqs. 188 and 189 and use u = 1/r, in equalities 2 and 3 we 
simplify, and in equality 4 we divide by —B?u?. Accordingly, the system of Eqs. 185-187 becomes: 


Fo = A (190) 
@u GMA? GM (du-!\? 
Fu = 191 
de Br t aa ( dy ) - uy) 
de 2 
Be es, 2B 192 


as required. 

Note: although the assumption that the gravitated object is initially in the plane 6 = 7/2 is justifiable 
(since we can choose our coordinate system in such a way that satisfies this condition) the assumption 
that the first derivative of @ vanishes (ie. 4% = 0) may not be obvious. Yes, this may be justified by 
the use of classical mechanics where orbits in central force field are shown to be planar (or equivalently 
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by employing the conservation of angular momentum) although this is much stronger condition (since 
it will make variations of all orders vanish) and it will make the rest of our argument redundant. 
However, since the initial point is arbitrary it can be made to satisfy both conditions (i.e. 0 = 1/2 
and a = 0) although the general validity of this may not be obvious. 
We should also note that due to the spherical symmetry of the physical setting the restriction of the 
motion to the equatorial plane (ie. @ = 2/2 condition) does not impose restriction on the general 
validity of the obtained geodesic equations since the equatorial plane is arbitrary and can be any plane 
that passes through the center of the gravitating body. 

3. Show that for massive objects the system of Eqs. 158-160 can be simplified to the system of Eqs. 
161-163. 
Answer: Massive objects follow timelike geodesics and hence the proper time parameter 7 can be 
used as an affine parameter in the geodesic equations, i.e. Eqs. 158-160 (see problem 1 of § 2.9.5). So, 


we have: 
po = A (193) 
dt 
du GMA? GM (/du-\? 
if  B°F Bae ( =) Rag Penne ae) 
dp 2 
a, = «Bu (195) 
Now, from the Schwarzschild quadratic form we have: 
(do)? = F(cdt)? — F~! (dr)? — r? (d0)* — r? sin? 6 (dd) 
(do)? = F(cdt)? — F~' (dr)? — r? (dd)? 
do)” Fol r2 
dey = easy? — 7 (any? — 5 (a8 
Cc C C 
3 = 2 =e fae 2 
(dr)" = F (dt) 2 (dr) x (dd) 


dt\? F-! (dr\* r? (db \? 
1 = F 1 
(=) C2 (F) Cc (3) (198) 
2 2 2 
(+) = (FF? (+) Fr? (2) CF 
T T T 
du-!\? gh PREY AP db) 5 
( dt ) = ee (=) u? (3) ae 


ine \: 2 42 2,2 2 
= CA*—B*Fu'-cF 
dt 
where line 1 is the Schwarzschild quadratic form with F = 1 — 2GM , in line 2 we use dO = O and 


sin? 0 = 1 since 0 = 7/2 (see problem 2), in line 3 we divide by c?, in line 4 we use the relation between 
o and 7, in line 5 we divide by (dr), in line 6 we multiply by c?F with some algebraic manipulation, 
in line 7 we use r = 1/u, and in line 8 we substitute from Eqs. 193 and 195 and simplify. On inserting 


du_ 
dr 


2 
the expression of ( *) from the last equation into Eq. 194 we get: 


du GMA? | GM 


242 2,2 2 = 
Ie BF erage (CA B°Fu —cF)+Fu = 0 


@u GMuw GM 
de a B2 
@u GMuw GM ( —) 
+ 1 U 


Fu = 0 


| 
ro) 


dd? (od B? Cc 
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fu GMuw GM 2GMu? 
dee @ geo ce = 0 
d7u ~GM 3G Mu? 
aT 7a a a 
d?u a GM  3GMu? 
ed SRAL hy, spat, ne GNS 
d¢? B? Cc? 
where in line 3 we use F = 1 — aoa Hence, the system of Eqs. 193-195 becomes: 
dt 
pens ae A, 
dt 
du GM 3GMu? 
dg? es Bz _ Cc? 
do 2 
Yodth. fl ont “NED 
dt a 


as required. 

4. Show that for massless objects the system of Eqs. 158-160 can be simplified to the system of Eqs. 
164-166. 
Answer: Massless objects follow lightlike geodesics (or null geodesics) and hence their quadratic form 
is zero, i.e. (do)? = 0. So, from the quadratic form of the Schwarzschild metric we have: 


0 = F(edt)? — F7! (dr)? —r? (d0)* — r? sin? 6 (dd)? 
0 = F (edt)? — F~! (dr)? —r? (dd)? 
2 ae | 2 2 2 
0 = F dt F dr re (dp 
dn ce \dr ce \ dr 
2 = -1\ 2 2 
a. dt F du 1 dd 
dy CC dX c2u2 \ dr 
A2 Fl (du-!\?_— Bu? 
0 = 
F Ce ( dX ) C 
dua}\* 2 42 23,2 
( ir ) = cCA*—B*Fu 
where in line 1 we use the Schwarzschild quadratic form with F = 1 — 2GM , in line 2 we use d@ = 0 


and sin? @ = 1 since 9 = 1/2 (see problem 2), in line 3 we divide by c? (dA)”, in line 4 we use r = 1/u, 
in line 5 we substitute from Eqs. 158 and 160 and simplify, and in line 6 we multiply by Fc? and 


manipulate. On inserting the expression of (45) from the last equation into Eq. 159 we get: 


@u GMA? GM 


2 42 27,2 
Ie BF tap CA B?’ Fu’) + Fu = 0 
du GMu? 
dee — = +r Fu = 0 
du GMu? ’ 2GMu 
dee eo T 1 72 UU = 0 
Zu GMu? 2GMu? 
de 2 ru ce = 0 
du 3G Mu? 


d¢? or 
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du ir 3G Mu? 
pol SIN? yee aan ate 
dd? on 
where in line 3 we use F = 1 — aoe Hence, the system of Eqs. 158-160 becomes: 
dt 
F- = A 
dy 
d7u 3GMu? 
dd? oo 2 
do 2 
he is 5B 
dn = 
as required. 
Exercises 


1. Summarize the essence of the formalism and strategy of general relativity in determining the trajec- 
tories of massive and massless objects in the spacetime under the effect of gravity. Hence, justify the 
investigation of this subsection and its applications. 

2. Compare the system of Eqs. 161-163 to the system of Eqs. 164-166. 


4.2 Kerr Solution 


This is another exact analytical solution to the Field Equations of general relativity. The Kerr met- 
ric describes the geometry of empty spacetime outside a rotating “Schwarzschild object” (i.e. static, 
spherically symmetric and electrically neutral gravitating object)!/6*! and hence it is an extension to the 
Schwarzschild solution by adding rotation to the Schwarzschild object. Again, it is a vacuum solution 
(due to the “empty” attribute) and hence it is a solution to the tensorial equation R,,, = 0 (also see § 8.7 
and § 8.7.2). The quadratic form (do)* of the Kerr solution is given by: 


R 2 
(doy = (1 AS) ceary® — © (ary? = 6? (a0) 
p 
ae 2¢@ 2 : 2 
aoe G a2 4 Rsra = ) (do)? + aE do (197) 
p p 
where Rg is the “Schwarzschild radius” (ic. Ry = 74M), a = =| with J being the magnitude of the 


angular momentum, (ct,r,,) are the Boyer-Lindquist coordinates (see exercise 10), p? = a? cos? 6+ 
r?, and A =r?—rRg+.a?. Moreover, the axis of rotation is assumed to be the same as the polar axis of 
the spatial coordinates. 

Problems 


1. Show that the Kerr metric converges to the Lorentz metric of special relativity when r > co. 
Answer: We have: 


et ee, 
rc roo a? cos? 6 +r? roo (a?/1r) cos? 6 +r 

; 7 . @*cos?64r? : (a? /r?) cos? 0 +1 

lim — = lim ————~ = lim =1 
r—oo roo r2—rRg +a? rsoo ] — (Rs/r) + (a2/r2) 

lim p? = lim (a* cos? +r?) = lim r? ([a?/r?] cos? 041) =r? 
Too TCO Too 


[168] We note that what is required for the Kerr problem (and the upcoming Kerr-Newman problem) is axial symmetry of 
the gravitating object. Also, “static” excludes “rotating”. 
1169] The parameter a may be called the Kerr parameter or the rotation parameter. 
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2 a2 2.2 
@ 6 
lim ( py ene ) = lim r? (14 (a?/r?) + a 


p T—00 per 
Rga? sin? 6 
hae 8 272) | 219 
eat (1+ ( fr) (@cos®O+r2)r) | 


; 2Rsrasin? 0 ; 2Rsrasin2 @ . 2Rgasin? 6 
lim ——— => lim A PA ee — lim (@2/r) cos? 6-47 = 0 
TCO 


2 


roo p r—oo \ a? cos? 6 + r? a?/r) cos? 0 +r 
Hence: 
lim (do)? = (1-0) (cdt)? — 1 (dr)? — r? (d6)* — sin? 6 (r?) (dd)* +0 
TOO 


= (edt)? — (dr)? — r? (d0)” — r? sin? 6 (dd)? 


which is the Lorentz metric (or rather quadratic form which is based on the metric) of special relativity 
with spatial spherical coordinates (see exercise 10 of § 2.5). In fact, we should also assume that the 
Boyer-Lindquist coordinates reduce to the ordinary coordinates of flat space at infinity (which is the 
case according to the definition of the Boyer-Lindquist coordinates; see exercise 10). 


Exercises 


1. Summarize the main characteristics of the Kerr solution. 

2. Write down the Kerr metric tensor. 

3. Show that the Kerr solution converges to the Schwarzschild solution when the gravitating object is not 
rotating. 

Show that the Kerr metric converges to the Lorentz metric of special relativity when M = 0. 

Show that the Kerr metric is time independent. 

Show that the Kerr metric is axially symmetric but not spherically symmetric. 

Investigate the singularities of the Kerr solution. 

Is the Kerr line element reversible in time and why? 

Why is there a cdt dé mixed term in the Kerr line element? 

What are the Boyer-Lindquist coordinates? 

What is the difference between “static” and “stationary” (as part of the terminology used in the dis- 
cussion of space metric)? 


a i ha 


—_ao 


4.3. Reissner-Nordstrom Solution 


This is another exact analytical solution to the Field Equations of general relativity. The Reissner- 
Nordstrom metric describes the geometry of spacetime outside an electrically charged “Schwarzschild 
object” and hence it is an extension to the Schwarzschild solution by adding electric charge to the 
Schwarzschild object. However, it is not a vacuum solution because the surrounding space is filled with 
electrostatic field which should generate non-vanishing energy-momentum tensor (see exercise 5 of § 2.14). 
It is noteworthy that the Reissner-Nordstrom problem (as well as the upcoming Kerr-Newman problem) 
of an electrically charged gravitating object (which is generally presumed to be a black hole) has mainly 
theoretical value since the existence of such an electrically charged object is questionable in real astro- 
physical situations where charge (if it does exist initially) should neutralize (or almost neutralize) in a 
rather short time due to the interaction with other objects in the vicinity (e.g. gas or cosmic dust) or 
through emission (if emission is possible).!'7°! Therefore, we do not investigate this solution (and the 
upcoming Kerr-Newman solution) in detail. 

Exercises 


1. Summarize the main characteristics of the Reissner-Nordstrom solution. 


[170] On a large scale matter is electrically neutral. 
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4.4 Kerr-Newman Solution 


This is another exact analytical solution to the Field Equations of general relativity. The Kerr-Newman 
metric describes the geometry of spacetime outside a rotating and electrically charged “Schwarzschild 
object”4741 and hence it is an extension to the Schwarzschild solution by adding angular momentum and 
electric charge to the Schwarzschild object. Again, it is not a vacuum solution because the surrounding 
space is filled with electromagnetic field which should generate non-vanishing energy-momentum tensor. 


Problems 


1. The gravitating object of the Kerr-Newman solution is characterized physically by mass, angular 
momentum and electric charge. Comment on this. 
Answer: This should remind us of the theorem (which is commonly called “no hair theorem”)!!7?! 
that any black hole is totally characterized (from the perspective of an external observer) by only three 
independent (and intrinsic) physical parameters: mass M, angular momentum J and electric charge 


Q (see § 8.7). In this sense, the Kerr-Newman metric may be considered as the most general metric 
for black holes. 


Exercises 


1. Summarize the main characteristics of the Kerr-Newman solution. 

2. Compare the Schwarzschild metric to the following three metrics: Kerr, Reissner-Nordstrom, and 
Kerr-Newman. 

3. Referring to black holes, what is the main physical distinction of the Kerr and Kerr-Newman metrics 
from the Schwarzschild metric? 


[171] We note again the requirement of axial (rather than spherical) symmetry and the exclusion of “rotating” from “static”. 
[172] Hair refers to the detailed physical properties of the object other than those three (i.e. mass, angular momentum and 
electric charge). 


Chapter 5 
Classical Limit of General Relativity 


According to the correspondence principle, the formalism of general relativity should converge to the 
Newtonian formalism in the classical limit (i.e. time independent weak gravitational fields sourced by 
matter only with low-speed gravitated object). This is a necessary (but not sufficient) requirement for the 
acceptance of general relativity because no correct theory should contradict the predictions of classical 
physics in its domain of validity. In the following sections we give some examples of this convergence 
where we will see that the classical formulae approximate the corresponding general relativistic formulae 
in the domain of validity of classical gravity. 


5.1 Convergence to Newtonian Gravity 


In this section we show that the Field Equation converges to the Newtonian gravity, as represented by 

the Poisson equation (i.e. Eq. 14), in the classical limit. The required assumptions for the validity of this 

classical limit are: 

1. The source of gravity includes matter but not energy such as electromagnetic radiation. This is justified 
by the fact that classical gravity is an attribute of mass in its classical sense and hence energy is not 
a source of gravity. 

2. The gravitational field is weak (see § 1.3 and exercises 18 and 19 of § 4.1.1 for the definition of “weak”). 
This is because in strong gravitational fields non-classical effects of gravity become tangible. 

3. The gravitational field is time independent. This can be justified by the fact that classical gravity is 
inherently a spatial phenomenon with no inbuilt time dependency (see exercise 2).!!731 

4. Any motion of the gravitated object is very slow relative to the speed of light c and hence the temporal 
component of its velocity 4-vector is much larger than the spatial components, i.e. ae > de C= 
1,2,3).U741 This is because at high speeds (i.e. comparable to c) peculiar special and general relativistic 
effects take place and this invalidates the classical limit treatment. 


Now, since the gravitational field is weak the space is approximately flat Minkowskian and hence the 
metric tensor g,,, can be approximated by: 


Guv ~ Nv + Epv (198) 


where 7, is the Minkowski metric tensor (i.e. diag [1, —1, —1, —1]) while ¢,,, is a perturbation tensor that 
represents first order perturbations to the components of the Minkowski metric such that |e,.| <1. 

The above assumptions for the validity of the classical limit means that the only non-zero component 
of the energy-momentum tensor T),, is Tp) = pc? (see § 2.14) because this component represents scaled 
mass density (which does exist in this classical setting) while all the other components vanish because 
they do not represent a classical source of gravity and hence they do not exist according to the given 
assumptions. In fact, this justification originates from a classical perspective. From a general relativistic 
perspective the component Too is much larger than any other component of T;,, in the classical limit and 
hence the other components are negligible in their overall contribution to gravity.!!75] Anyway, let ignore 


1173] Regardless of this justification, this can be seen as a convenient case for demonstrating the convergence. 

[174] This condition should become more obvious if we note that the temporal component of the velocity 4-vector is yc while 
the spatial components are yu’. 

275] Tn fact, we may assume an energy-momentum tensor of a homogeneous cloud of dust (possibly in a co-moving frame; 
see problems 3 and 4 of § 2.14) which seems appropriate in this classical setting (although it may weaken the generality 
of our argument). Moreover, since we are dealing with classical limit (and possibly in a co-moving frame) then p and 
po are essentially the same (and we use p for simplicity). 


122 
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these trivial and messy details (some of which may be questionable) and try to obtain the classical limit 
from the 00 component alone since the field formulation of classical gravity (ie. Poisson equation) has 
only one component which should correspond to the 00 component of the general relativistic formulation. 
Accordingly: 


1 
Rw = (Tw — 507) (199) 
1 
Roo = « (Tw — 5007 ) (200) 
2 1 2 
Roo ~& aG = 5 no + 20] oc) (201) 
2 1 2 
Roo = « (oe — 5 ll + €00] pe) (202) 
gto 1g 
Roo = | pcr — Pe — aPC E00 (203) 
DP cgeod Ss 
Roo = & g Po — BPC E00 (204) 
1 
Roo & 5 hee (205) 


where in line 1 we use the tensorial Field Equation in one of its forms (see exercise 5 of § 3.2), in line 2 we 
take just the 00 component of the tensorial equation, in line 3 we use Tp) = T' = pc? and goo © Noo + €00 
(see Eq. 198), in line 4 we use 799 = 1, and in line 7 we use |eo9| < 1. 

Similarly, from the geometric definition of Rog we have (see Eq. 89): 


Roo = O13, — Ool'Sy + Foal Zo —Pbo0 Ga Y Jal Go (206) 


This approximation is justified by the fact that 0915, = 0 due to the presumed time independence while 
yah and Pres are negligible in our first order approximation since they are of higher order in €,, 
(refer to § 3.4). Now, ['§, is given by (see Eq. 52): 
P ye 
Poo = > (90908 + A908 — Asgoo) 
B 
g* 
= -d 
5) 8900 
ia 
—5 96 (m00 + E00) 


I 


I 


Ge 
TRE (1 + E00) 


ap 
= ~*y Oem 


1 
ocd — 52" €00 (207) 


where line 2 is because 0o90g = 0 due to time independence, in line 3 we substitute for goo using Eq. 198, 
in line 4 we use 79 = 1, and in line 6 we raise the index.!!7°| On substituting from Eq. 207 into Eq. 206 
we obtain: 


Roo SS —Oy (-32°<00) 


[176] We note that g°® may be replaced by 7%? for index raising (see Eq. 111) as we did in § 3.4. In fact, this could be 
needed in justifying some of the subsequent steps (see footnote [144] ). 
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1 
= 70a" €00 (208) 
l_» 
= 2 E00 (209) 
1 
= —5V "E00 (210) 
where in line 3 we use the definition of the d’Alembertian operator, and in line 4 we use time independence 
(noting that 0? = oe — V?; see exercise 15 of § 2.2). On comparing Eq. 210 with Eq. 205 we get: 


V7E09 = —Kpc? (211) 


Now, from the geodesic equation, using the proper time parameter 7 as an affine parameter (see problem 
1 of § 2.9.5), we have: 


ea Bes =0 (212) 


The term F¢, 4#" 427 is a sum of terms all of which are negligible according to our approximation except 


aoe By dr dr iain 
00 ae ae The reason for the dominance of I$) te a 


condition > ae which makes all the terms involving one or two spatial components of the velocity 
4-vector negligible in comparison to the term that involves the square of the temporal component, i.e. 


over all other terms combined is the aforementioned 


boc Ld Accordingly, the geodesic equation can be approximated by: 
Boe se th (@) (213) 
_ ~ ( 50°w ei (214) 
we = aa) 7 a9 
oe ow 
oo ~ = oem (217) 


where in line 2 we substitute from Eq. 207, in line 4 we use x° = ct, and in line 5 we use the fact that in 
the classical limit we have dr ~ dt.!'771 Now, if we take the spatial components only (which are the only 
components that exist classically or at least they are the only ones that we need here) we get: 


d?x* cage . 
We F 0"€00 (¢ = 1,2,3) 
2 
c a 
ad 2 [—Veoo] 
Ce 4 
a ig, [Veoo] (218) 


where in line 2 we use the definition of the contravariant partial differential operator 6° (see exercise 15 
of 2.2). 


1/2 
(1771 In the classical limit the gravitational field is weak and hence we have (1 oy 2gM ) ~ 1 and therefore from Eq. 243 


we get dr ~ dt (see § 6.3.1). Also, any special relativistic effects are negligible due to the low-speed limit and hence 
+ ~ land dr ~ dt (see B4). We may also multiply both sides of line 4 with (dr/dt)? and use the chain rule noting that 


a 2 @ 
ane (4) — = (see problem 1 of § 2.9.5) although this should require t to be an affine parameter (which may be 


reasonable in the classical limit). Also see exercise 7. 
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Referring to Eq. 12, which is the classical formulation of the gravitational field noting that ig] = 
oe Ua we have: 
a’ 
dt? 
So, if we equate the right hand side of Eq. 218 (which is the approximate general relativistic formulation as 
derived from the geodesic equation) to the right hand side of Eq. 219 (which is the classical formulation) 
we get: 


= — [V4] (219) 


Cc 
VG = > VEoo (220) 


On taking the divergence of the two sides of the last equation we obtain: 


V-V® = Pied - VEoo (221) 
Vd = Cv Eo0 (222) 
V7b = S (—Kpc’) (223) 
V’7> = — Srp! (224) 


where in line 2 we use the definition of the Laplacian operator as the divergence of gradient, and in line 
3 we substitute from Eq. 211. 
Now, the Poisson equation of classical gravity is given by (refer to Eq. 14): 


V° = 4nGp (225) 


So, if « in the general relativistic part of Eq. 224 is defined as k = — sxe then Eq. 224 becomes: 


1 81G 

2 a 

Ve= = (-=*) pe =4nGp (226) 
which is identical to the Poisson equation of classical gravity. We may also reverse this argument by 
saying: if the general relativistic formulation should converge to the classical limit then by comparing Eq. 
224 to Eq. 225 we get: 


— Spe! = AnGp (227) 
87G 


This should justify what have been stated in § 3.2 about the mathematical expression of «. In brief, 
the general relativistic formulation (which is mainly based on the Field Equation as explained above) 
converges to the classical gravity formulation (which is represented by the Poisson equation as explained 
in § 1.5) with « of the general relativistic formulation being defined by Eq. 228. 

Finally, it is time to show that C, = =o = Rg which we stated in the final stage of the derivation of 
the definite form of the Schwarzschild metric in § 4.1.1 without sufficient justification.!!”9] On integrating 


the two sides of Eq. 220 we get: 


2 
®= %H £00 +A (229) 


[178] In fact, we should need the equivalence principle (in conjunction with Newton’s second law) to justify this. However, 
we should need no more than the classical one and hence there should be no problem. 

1179] Ty fact, this requires the assumption of the applicability of the classical limit approximation to the Schwarzschild 
solution. Although some of the classical limit assumptions are consistent with the Schwarzschild assumptions (e.g. time 
independence) some may not (e.g. weak gravitational field) unless we restrict the validity of the Schwarzschild solution 
which we should not. So, the validity of C,; = 2G in general may require further assumptions and extensions. 
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where A is the constant of integration (noting that gravity is a central field). Now, ® is given classically 
by & = —&™ (see exercise 2 of § 1.5) and hence at infinity (ic. r + co) we have ® = 0. Moreover, at 
infinity the spacetime is flat and hence ¢g99 = 0. Therefore, we should have A = 0, that is: 


od = 00 (230) 
GM oa 
= = = 231 
r 2 an (231) 
2GM 
€00 = — 5) (232) 
Cr 
So, from Eq. 198 we get: 
2GM 
goo ~ Noo + €00 = 1 — ap (233) 
Noting that in § 4.1.1 we obtained (see Eqs. 122 and 147): 
Cs 
goo =e =1- ae (234) 


we can see (by comparing the last two equations) that C, = ood = Rg as claimed in § 4.1.1. 

Problems 

1. Summarize the essence of this section. 
Answer: Assuming classical limit conditions (i.e. weak, time independent gravitational field sourced 
by matter only with low speed): 

a) From the Field Equation we obtained Roo ~ $Kpc?. 


b) From the geometric definition of Ro9 we obtained Roo ~ —$V7 <0. 
c) On comparing the equations in a and b we optamed V7e00 = = —Kpc?. 
d) From the geodesic equation we obtained 4 oe = 5 [Veoo]' ; 

e) From the classical formulation we have oe =— ial ‘ 


f) On comparing the equations in d and e we obtained V® = | Ve00- 
Cc 


g) On taking the divergence of the equation in f we obtained V?® = $V7eo0. 

) On substituting from the equation in c (which is a aia equatio cad in the equation in g (which 
is a classic-relativistic equation) we obtained V?® = —tKpct : 
(i) Hence, if « in the general relativistic part of the equation in h is defined as K = a then the 
equation in h becomes V?® = 4G which is the classical Poisson equation. In other words, in the 
classical limit (associated with an appropriate definition of «) the classical and general relativistic 
formulations agree. 


( 
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Exercises 


1. What are the main factors that determine the intensity (or strength) of the gravitational field? 

2. Considering that classical gravity is basically a spatial phenomenon with no time dependency, can we 
have time dependency in classical gravity? If so, what is the difference between time dependency in 
classical gravity and in general relativity? 

3. The condition of time independence may be stated by some as slow variation in time. Comment on 
this. 

4. Does the condition of slow speed relative to c contradict the time independence of the gravitational 
field? 

5. Outline, roughly and in general terms, the situations where general relativity can be replaced by 
classical gravity. 

6. Use the approximation goo ~ moo + €00 = 1 — 2GM which is given in the text to propose a formal 
condition for the validity of classical gravity. Also interpret this condition. 

7. Justify the claim that in the classical limit we have dr ~ dt considering both the special and general 
relativistic effects. 
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8. Obtain (using a compact method) the classical equation of gravitational field from the geodesic equation 
of general relativity in the classical limit. 


5.2 Planetary Motion 


In this section we obtain the Newtonian formulation of planetary motion as a classical limit for the 
general relativistic formulation. In other words, we will show that in the classical limit the general 
relativistic formulation converges to the Newtonian formulation as given in § 1.5.1 by Eqs. 15-16 and 
hence the planetary motion is described by Newton’s laws (and indeed by Kepler’s laws which are direct 
consequences of Newton’s laws). In fact, the results that we obtained in § 5.1 about the convergence of 
the Field Equation to the Newtonian gravity in the classical limit should be sufficient to establish the 
claim that we can obtain the Newtonian formulation of planetary motion as a classical limit for general 
relativity.!48°l However, the content in the present section should establish this fact more vividly and 
directly. Moreover, it can serve as a double check for the previous results. We should also note that 
planetary motion here is more general than the motion of planets in the solar system. In fact, it applies 
to all sorts of motion that meet the given criteria and hence we may call it orbital motion for the sake of 
generality. However, we prefer “planetary motion” due to its correspondence to the historical investigation 
of planetary motion in classical physics (see § 1.5.1). Moreover, orbital motion is too general since it 
extends to orbiting systems that do not meet the conditions of classical limit (e.g. some binary orbiting 
systems of compact objects with very high orbital speed). So, “planetary motion” here can be seen as a 
typical example of orbital motion of classical orbiting systems. 

It should be obvious that what is required here is to show that in the classical limit the tensorial 
geodesic equation of general relativity converges to the Newtonian formulation for planetary motion since 
the geodesic equation describes the trajectory of gravitated free objects in spacetime. Now, the geodesic 
equation (see Eq. 73) requires the knowledge of the Christoffel symbols which depend on the metric of 
spacetime (see Eq. 52). So, what metric should we use to describe the geometry of the spacetime of 
the planetary motion in the classical limit? It should be obvious that the appropriate metric for the 
spacetime in this case is the Schwarzschild metric (because Schwarzschild object is a good prototype for 
the gravitating objects in orbiting systems).|184) 

In fact, we have already obtained the Christoffel symbols of the Schwarzschild metric in § 4.1.1. More- 
over, we have obtained in § 4.1.2 the system of geodesic equations of the Schwarzschild spacetime (i.e. the 
system of Eqs. 154-157) and simplified this system to the system of Eqs. 161-163 for the case of massive 
objects which is what we need here since planets are massive objects. So, all we need for demonstrating 
that the general relativistic formulation converges to the Newtonian formulation in the classical limit is 
to show that the general relativistic system of Eqs. 161-163 reduces to the Newtonian system of Eqs. 
15-16 in this limit and hence all the classical formulations (e.g. Kepler’s laws) that we obtained in § 
1.5.1 from Eqs. 15-16 are obtainable from the general relativistic formulation in the classical limit. This 
demonstration is given in the Problems. 


Problems 


1. Show that in the classical limit the system of Eqs. 161-163 will reduce to the classical system of Eqs. 
15-16 (which means that in the classical limit the general relativistic formulation for planetary motion 
reduces to the corresponding classical formulation). 

Answer: First, we discard Eq. 161 by noting that in the classical limit we have a ~ 1 and hence Eq. 
161 is reduced to F' ~ A which is just an approximation for F’. Now, since F and A have no presence 
in the other two equations then Eq. 161 is redundant in the classical limit and hence the system of 


[180] In fact, the results of § 5.1 are sufficient from a gravitational perspective. So, if we note that classical planetary motion 
requires other classical results (e.g. Newton’s second law) then we may still need (partially at least) this section to 
establish the above claim. 

[181] We should note that some restrictive conditions in the Schwarzschild metric (such as spherical symmetry and non- 
rotation of the gravitating object) should apply in the considered orbiting systems. These conditions are generally valid 
(at least approximately) in the common classical orbiting systems. 
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Eqs. 161-163 reduces to two equations only, that is: 


du GM 3GMu? 
do 2 
wit war pRB 2 
ey (236) 
Next, we reduce Eq. 236 to Eq. 16 using the same argument, i.e. in the classical limit we have dr ~ dt 
and hence ae in Eq. 236 becomes ae Accordingly, the system of Eqs. 235-236 becomes: 
du GM 3GMu? 
dee +u = Be st 2 (237) 
do 2 
ai u (238) 


Finally, we need to reduce Eq. 237 to Eq. 15. To do this we should discard the term 23¢44 uw by showing 


C2 


that in the classical limit this term is negligible in comparison to the term ot . The ratio of sea 


GM 
B2 


to is: 


3G Mu? ° B? 3u? 
C2 GM Cc 


_ wa (2) 
c? ut \ dt 
sje ei 
cu? \ dt 
Soe ey) 
c \ dt 
where in line 2 we use Eq. 238 and in line 4 we use u = 1/r. Now, if we take the Earth as a typical 


planet then the average distance from the Sun is r ~ 1.4960 x 10!! m and the angular speed around 
2 
the Sun is a ~ 1.9924 x 107-7 rad/s, and hence the ratio of 3c to GM is: 


2 
x (1.9924 x 10-7)” ~ 2.9614 x 10-8 


3r? (2) 3 x (1.4960 x 101! 
: ~ 


2 \ dt (3 x 108)? 


ie. in the classical limit the term GN u> ig very small compared to the term ut and hence the term 
oGMnE can be discarded and Eq. 237 becomes: 

iu n GM 

pee py eae 

dd? B2 


Accordingly, the system of Eqs. 237-238 becomes: 


dtu GM 
do 2 
OP 240 
a u (240) 


As we see, the system of Eqs. 239-240 (which is obtained from the geodesic equation of general 
relativity as a classical limit) is identical to the system of Eqs. 15-16 (which is obtained from Newton’s 
laws). This means that in the classical limit the general relativistic formulation for planetary motion 
reduces to the corresponding classical formulation, as claimed. 
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Note 1: it may be claimed that the validity of the above argument should require an extra condition 
that is the B in the classical formulation is equal to the B in the general relativistic formulation. 
However, we think this is a trivial requirement that can be obtained directly (if needed) from the 
similarity of the above two formulations in the classical limit plus the correspondence principle. In 
other words, the two formulations are identical provided that B in the two formulations is the same 
(similar to what we did in § 5.1 with «). In fact, Eq. 240 (which is relativistic as well as classic) 
identifies B as the magnitude of angular momentum per unit mass or twice the areal speed (see § 
1.5.1) and hence B should represent the same physical quantity in both formulations. 

Note 2: there is an important difference between the classical and general relativistic formulations 
(apart from the term sever), This difference is related to the meaning of the coordinates (specifically 
t and r = 1/u) since in general relativity they are just coordinate variables while in classical physics 
they are physical variables (see § 6.2). However, in the classical limit the gap between the two should 
be bridged because the spacetime in this limit is essentially flat and hence the coordinate variables 
reduce to the physical variables (in fact this has already been considered in a sense in the above 
formulation with regard to t where we used the approximation dt ~ dt since t corresponds to proper 
time 7 according to this approximation). 

f saa to GM 


Sa :« Can 


Note 3: if we note that rie = v (with v representing orbital speed) then the ratio o or 


2 
a 3r? (de\" _ 30 
be expressed as 3 ( a) = 


In fact, this demonstrates the tininess of this ratio in a more general way than the example of the 
Earth (which is given for pedagogical purposes and to show this fact more realistically and vividly). 


which in the classical limit (where v < c) should be very small.!!81 


Exercises 


1. Give some examples of necessary assumptions that should be made about the gravitating object for 
the validity of employing the classical limit in the planetary motion. 

2. Give some examples of necessary assumptions that should be made about the gravitated object for the 
validity of employing the classical limit in the planetary motion. 


[182] For example, the orbital speed of the Earth is v ~ 29.8 km/s and hence eee ~ 2.96 x 10~8 which is similar to the above 
calculation. 


Chapter 6 
Frames, Coordinates and Spacetime 


In this chapter we investigate frames and coordinates in general relativity and their relation to spacetime. 
In fact, the materials in this chapter should have been investigated in an early chapter. However, the 
investigation is delayed to this position in the book because part of the materials requires having specific 
spacetime metric which was the subject of our investigation in § 4. As we will see, we will use here the 
Schwarzschild metric (in the Schwarzschild coordinates) that we investigated earlier in § 4.1. The choice 
of the Schwarzschild metric is partly justified by being the most typical and widely used metric in the 
applications of general relativity as well as its favorable properties such as simplicity. However, we would 
like to emphasize that many of the results that we obtain in this chapter depend on the validity of the 
Schwarzschild metric and hence they may not be valid if the spacetime is metricized by another metric. In 
other words, the results are “Schwarzschild-based general relativistic results” and not “general relativistic 
results” unconditionally.!!83! 


Exercises 


1. Discuss the consequences of the dependency of the results that we obtain in this chapter on the 
Schwarzschild metric. 


6.1 Frames in General Relativity 


In this section we investigate the different types of frame in general relativity and compare them to the 
frames in classical mechanics and in special relativity (or what we call Lorentz mechanics). As we will 
see, several types of frame are needed in general relativity for analyzing various physical situations from 
different perspectives and drawing the required physical consequences. 

It is obvious that the frames in classical mechanics and in special relativity are global as they cover 
the entire spacetime because the spacetime is flat and uniform in its metric properties and hence there is 
no necessity or advantage in using local frames (since this requires using numerous local frames to cover 
the entire spacetime) when a single global frame that covers the entire spacetime is sufficient to fulfill 
the intended objectives and achieve the purpose of reference frame. However, the situation in general 
relativity is very different because the spacetime is globally curved and locally flat and we are supposed 
to apply the rules of special relativity at local level. So, it is more sensible and advantageous to use local 
frames to coordinate local patches of spacetime and develop the required physics as dictated by the plan 
of general relativity. 

Accordingly, in the development of the concepts, arguments and formalism of general relativity (as 
well as its application) we may consider (motivated mainly by pragmatic purposes as well as physical 
requirements) three main types of local frame: 

(a) Frame at infinity (ie. very far away from any source of gravitation). Since this frame does exist 
in a flat part of the spacetime then the rules of special relativity should apply in the frame due to the 
actual absence of gravity. Whether this frame is moving uniformly or at rest (considering the relativity 
of these attributes although in a sense they are absolute relative to the source of gravity) is irrelevant 
in this context because the frame is subject to the rules of special relativity regardless of its kinematical 
state. However, the frame should not be accelerating (as indicated by “uniformly”) because otherwise it 
will be like a gravitational frame due to the presumed equivalence between gravity and acceleration (or 


[183] We should note that some of the general relativistic results related to the Schwarzschild metric may even depend on 
the Schwarzschild coordinates specifically. In fact, this is a general feature in general relativity and is not specific to 
the Schwarzschild metric and coordinates (see § 10.1.7). This should add more limitation to general relativistic results 
in general. 
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alternatively because special relativity is restricted to inertial frames). Accordingly, when we talk in this 
book about a frame (or observer) at infinity (or far from the source of gravity) we generally assume it to 
be inertial unless we indicate otherwise.!!*41 

(b) Freely falling frame in a gravitational field. Although this frame does exist in a curved part of the 
spacetime the rules of special relativity should also apply in the frame locally. The reason is that the 
gravity is effectively absent due to its neutralization by the acceleration of free fall according to the 
equivalence principle. In fact, we may say: the weak equivalence principle establishes the status of the 
frame as inertial by canceling the effect of gravity while the strong equivalence principle ensures the local 
application of special relativity. 

(c) Frame in a gravitational field but it is not in a state of free fall, i.e. the frame has a propelling 
mechanism that prevents free fall under the influence of gravity and hence it is moving non-uniformly 
relative to a freely falling frame in its location. This frame should either be at rest in the spacetime 
relative to the source of gravitation or be moving in a non free fall fashion. In the first case (i.e. at rest) 
the frame is equivalent to an accelerating frame while in the second case the frame is either equivalent 
to an accelerating frame or it is actually an accelerating frame.!!85! In both cases, the rules of special 
relativity should not apply because the frame essentially is neither inertial nor equivalent to inertial. More 
details about this type of frame is given in the Problems. 

It is noteworthy that in the above classification of frames we are assuming that the frames are non- 
rotating; otherwise the frame of any one of the above three types will be accelerating and hence it will 
not be subject to the rules of special relativity (also see § 6.5). As we will see, this should lead to the 
conclusion of the necessity of the existence of an absolute frame in the background of the spacetime 
of general relativity (similar to what we found in B4 about this necessity in the spacetime of special 
relativity) and this conclusion has many physical and epistemological implications and consequences that 
will be investigated in the future. 


Problems 


1. Provide more details about the local frames of type c. 
Answer: There are different views in the literature about this messy issue. In the following points we 
provide some details: 
e Some authors seem to suggest that the frames of this type are also subject to the rules of special 
relativity (apparently with no condition). But this is obviously wrong (if the application is uncondi- 
tional like the application in the flat spacetime of special relativity) because this effectively nullifies 
the effect of gravity. 
e Some authors suggest that special relativity should apply if we assume the existence of fictitious 
forces that influence the objects in its surrounding in such a manner that makes the frame effectively 
inertial. However, for this to apply we may need to assume that such an assumption is physically 
viable and there is no absolute frame and hence everything is relative. 
e It is also suggested that since a stationary gravitational framel!®*! is equivalent to an accelerating 
frame (according to the equivalence principle) then the rules of special relativity should apply to the 
“equivalent instantaneous rest frames” which are inertial frames by definition (see B4). So, by consid- 
ering the instantaneous rest frames (which are inertial), special relativity should apply in this series 
of frames “event by event”. However, the validity and legitimacy of this interpretation requires direct 
experimental evidence (as well as proper explanation and application). In fact, the equivalence of the 
observations in an accelerating frame and in its instantaneous inertial rest frame can be challenged 
and refuted at least in some cases (e.g. the accelerometers in the two frames should have different 
readings) and hence the validity of this interpretation is debatable (at least in its generality). 
e It is also proposed!!87] that in a stationary gravitational frame we should first apply special relativity 


[184] Ty fact, in most cases we even assume it to be at rest to exclude special relativistic effects caused by motion. 

185] In both cases we are considering the acceleration relative to the frame of the source of gravitation and not relative to 
an absolute frame. Accordingly, the frame is equivalent to an accelerating frame if it is moving uniformly. 

“Stationary gravitational frame” here and in the next point should be a typical instance of the frames of type c and not 
the only instance of this type. 

[187] This proposal (which is inline with the Principle of General Covariance; see problem 3 of § 1.8.2) seems to be the most 


[186] 
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in the freely falling local frame (which is effectively inertial coordinating a flat patch of spacetime) 
and then we transform the special relativistic laws to the stationary frame by a general coordinate 
transformation (and hence what actually applies in the stationary frame is a “modified version” of the 
special relativistic laws in which the effect of gravity is incorporated). However, the validity of this 
depends on the validity of the strong equivalence principle (whose general validity is questionable; in 
fact observational evidence seems to be against it; see § 10.1.1 for example). It should also depend 
on the existence and availability of such a transformation (which is not obvious at least in general 
especially when we consider non-stationary frames of this type). 

Finally, we should note that there are many details about the status of the frames of type c and the 
potential challenges to the above views and interpretations. However, most of these details are not 
very useful and some may be indeterminate within the framework of the theory and hence the value 
of any investigation or analysis will be largely speculative. Therefore, we prefer not to go through 
these consuming details (although some of these issues will be investigated further rather briefly in the 
future; see for example § 10.1.1 and § 10.1.11). Also, some of the above views may differ from others 
in presentation and form rather than in essence and content (due for instance to the use of misleading 
or lax phrasing or different methods of presentation and expression). 


Exercises 


1. The frames in classical mechanics and in special relativity are usually global while the frames in general 
relativity are generally local. Why? 

2. Is it necessary that the frames in general relativity are local? Why? 

3. If there are frames in general relativity that are not subject to the rules of special relativity (i.e. 
gravitational frames that are not in a state of gravitational free fall) then how are these frames treated 
in general relativity? Can this be a source of defect or limitation in the theory of general relativity? 

4. What are the cases in which special relativity applies in the spacetime of general relativity? 

5. For the application of special relativity, what conditions we need to assume about the spacetime and 
the reference frame? Try to link this to the application of special relativity locally in freely falling 
frames in the spacetime of general relativity. 

6. Discuss the status of accelerating frame at infinity (i.e. very far away from any source of gravity) in 
the spacetime of general relativity. 

7. In the text we considered three main types of local frame in the spacetime of general relativity. Try 
to elaborate on this. 


6.2 Coordinates in General Relativity 


In this section we investigate the significance and interpretation of coordinates in general relativity. We 
will use in our investigation (partly in this section and mostly in the subsequent sections which continue 
this investigation) the Schwarzschild coordinates as an example and case study and hence we investigate 
their interpretation and highlight their significance. This should help us to appreciate the meaning of 
coordinates in general relativity in general despite the restricted nature of the investigation. 

The main issue regarding the meaning of coordinates in general relativity and their interpretation is 
related to the metrical significance of the coordinates of spacetime. In other words, when we use (ct, r, 8, ¢) 
in Schwarzschild spacetime for example do t¢ stand for the physical time as measured by a clock and r stand 
for the physical distance as measured by a stick? Or we should rather consider t and r as mere coordinate 
variables (or useful symbolic tools or labels or markers) for obtaining the physical time and distance 
through the fundamental relations in the spacetime which are embedded in the metric of the spacetime? 
The answer (according to general relativity) is that the coordinates are just symbolic tools to label and 
distinguish points (or events) in spacetime and hence they do not have direct metrical significance or 
physical value. Therefore, the physical variables such as time and length should be obtained from the 
metric of the spacetime through the established relations where through these relations (and only through 
these relations) the coordinates provide physical meaning and significance. For example, in the relation 


rational, acceptable and consistent with the framework of the relativity theories. 
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dr = (1- Gu ) M? at (which will be investigated thoroughly in § 6.3.1) we have coordinate variables r and 
dt and a physical variable dr and hence the function of this metrical relation is to convert the coordinate 
variables r and dt to a physical variable dr, i.e. we use r and dt (which are labels to specific point and 
interval in spacetime) in this metrical relation to obtain the physical variable dr. 

An obvious consequence of the interpretation of coordinates in general relativity is that we will have 
coordinate time versus physical time and have coordinate length versus physical length. Although this 
might seem sensible from a formal or mathematical perspective, it requires justification and physical 
rationalization as well as addressing some practical problems. This issue will be assessed further later on 
(in this section as well as elsewhere such as § 10.1.6). However, the simple justification from a general 
relativistic viewpoint is that the qualification of coordinates as representative of physical variables comes 
from our experiences and presumptions in flat spaces where coordinates have direct physical significance, 
but this is not the case in general relativity where the spacetime is curved and hence the coordinates do 
not necessarily represent physical quantities although they are related to the physical quantities through 
the metric. 

Regarding the issue of the Schwarzschild metric, we used the spatial variables (r,6,¢) to represent 
the spatial coordinates in the Schwarzschild spacetime. It should be noted that in principle the (r, @, ¢) 
coordinates in the Schwarzschild spacetime are not the same as the (r,6,¢) coordinates in the familiar 
spherical coordinate system of a Euclidean 3D space despite the strong similarity between the two. The 
reason is that the (r,6,¢) of the spherical system represent coordinate variables of a flat space (and 
hence they represent physical variables with metrical significance) while the (r,6,¢) of the Schwarzschild 
spacetime represent coordinate variables of a curved space (and hence they are useful labels that should be 
used in conjunction with the metrical relations to obtain the physical variables). Therefore, in principle 
the (r,6,¢) of the spherical system are not the same as the (r,0,¢) of the Schwarzschild spacetime. 
Yes, since the Schwarzschild spacetime is asymptotically flat at infinity then the two become identical 
(asymptotically) at infinity and this should provide the missing link between the two that justifies the 
claim that the spherical coordinates underlie (in a sense) the spatial part of the Schwarzschild spacetime. 
We will also see that the difference between the two sets of coordinates (i.e. Schwarzschild and spherical) 
actually occurs only in the radial variable (i.e. 7) and that is why we used “in principle” in the above 
statements. 

We should remark that although the distinction between non-metrical coordinates and metrical variables 
may be understandable from a pure theoretical perspective, in some practical situations (when solving 
real world problems; see for example § 8.3) this distinction is difficult to comply with and implement. 
The reason is that in some of the formulae that contain non-metrical coordinates we do not have values 
for these non-metrical coordinates and hence we find ourselves obliged to use metrical values instead. For 
example, when we find t or r (which are coordinate variables) in a formula that is supposed to be used 
to solve a real world problem we may not know the value of these coordinate variables and hence we will 
have no choice but to use the actual physical time and radial distance (i.e. the metrical variables) because 
in real life we have access only to physical quantities (e.g. the time of a physical event that we observe 
is a physical time and hence when we want to use the time of this event in a metrical relation to obtain 
another physical quantity that depends on coordinate time we have no choice but to use the physical time 
instead of the coordinate time that is required as an input to the metrical relation). This should cast a 
shadow over the practicality of this distinction and the feasibility of the general relativistic methodology. 

We believe this could be a demonstration of the unsuitability of using a geometric approach to describe 
physical phenomena. In other words, in a purely mathematical (or geometrical) theory we have no such a 
problem because we are dealing with entirely abstract spaces and objects. Similarly, in a purely physical 
theory we have no such a problem because we are dealing with entirely physical entities and hence no 
such inconsistency can arise. But when we try to correlate the abstract entities of a geometric theory 
to physical entities in the real world we face such a departure and conflict between the abstract and the 
physical. In fact, the root of this problem is in the use of curved spacetime in modeling physical gravity. 
As we will discuss later, we think gravity should be treated like any other physical phenomenon, i.e. it is a 
physical phenomenon that takes place in a flat spacetime rather than an attribute of a curved spacetime. 
In our view, this general relativistic paradigm of modeling gravity as a curvature of spacetime (according 
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to the principle of metric gravity; see § 1.8.4) rather than a physical phenomenon in an ordinary flat 
spacetime is the origin of several problems (apart from being non-intuitive and unnecessary). Of course, 
general relativists do not agree on the above criticism and they have certain approaches and strategies 
to overcome the aforementioned difficulties. However, we will meet in the future some examples related 
to this criticism and the defence of general relativists and we will leave the final judgment to the readers 
(also see § 10.1.6). 


Exercises 


1. What distinguishes general relativity from classical mechanics and special relativity and hence makes 
the coordinates in general relativity of symbolic rather than metrical or physical value? 

2. Can we fix the problems that potentially arise from the premise that the coordinates of spacetime in 
general relativity have no metrical or physical significance? 

3. Assess the coordinates of rectilinear and curvilinear systems of 3D flat Euclidean space, e.g. Cartesian, 
cylindrical and spherical coordinate systems. 

4. Use a specific metrical relation as an example to elucidate the issue that in real life we have access 
only to physical (rather than coordinate) variables. 

5. Address the problem that modeling gravity as a physical phenomenon in a flat spacetime (rather than 
a geometric attribute of a curved spacetime) may be impossible or non-physical if the actual physical 
spacetime is not flat. 


6.3. Time in Schwarzschild Spacetime 


To analyze the meaning and properties of time in general relativity we use the Schwarzschild spacetime as 
a typical general relativistic spacetime and use stationary!'8*! frames of type c (refer to § 6.1) located at 
different positions in the Schwarzschild spacetime. In the following subsections we investigate a number 
of situations and settings to highlight several important issues about time in general relativity. 


6.3.1 Relation between Coordinate Time and Proper Time 


Let have a stationary frame at a certain position in the Schwarzschild spacetime and consider events that 
vary in time only. Now, since the variation is in time only then the variation in the spatial coordinates is 
null, i.e. dr = dO = dé = 0. Therefore, the Schwarzschild quadratic form (see § 4.1.1) will reduce to:!18°! 


(do)? = (:- 6") (cdt)? (241) 
(dr)? = 0-7" ) (at)? (242) 
dr = (1- 26M” 243 
(2 0s 


where in line 2 we divide by c? and use the relation (dr)? = (do/c)?. The significance of Eq. 243 is that: 


(a) At finite r and for 2¢™ < 1 the proper time interval dr is shorter than the coordinate time interval 


dt by the factor (1 - zeny ® 
(b) At infinite r (i.e. r > oo) the proper time interval dr is equal to the coordinate time interval dt. 
In brief, in the Schwarzschild spacetime the proper time interval is shorter than the coordinate time 


interval but it converges to the coordinate time interval at infinity (see § 6.3.3). 


Problems 


[188] Stationary here means at rest spatially relative to the source of gravity. The term “stationary” may also be used in the 
literature to mean time independent (i.e. there is no variation with respect to time). 


1189] Th the following we assume 2GM < 1 which is a valid assumption in general. Apart from being a mathematical necessity 
(to avoid having complex value when 26M > 1 and singularity when 2GM = 1), this condition excludes what is on and 


inside the event horizon of a black hole noting that M is the mass contained inside r. 
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1. Justify point (a) and point (b) about the significance of Eq. 243. 


Answer: For finite r and 2¢“ < 1 we should have (1 — 2cunee <1. This justifies point (a). 


If r + oo then 25“ — 0 and hence (1 — 2gm)1/? — 1. This justifies point (b). 


cr 
Exercises 


1. What the condition 2GM < 1 means? 


2. What you note about the formula dr = (1 — 2g )1/? dt? 


Cc 


6.3.2 Interpretation of Coordinate Time 


If r > co in Eq. 243 then we have: 
AT xq = dt (244) 


where we use the subscript oo to indicate the proper time at infinity. The significance of Eq. 244 is that: 
(a) The coordinate time is equal to the proper time of a (stationary)!/°°! observer at infinity. In fact, this 
can be used as a definition for the Schwarzschild temporal coordinate t and hence we say: the coordinate 
time t is the proper time of a (stationary) observer at infinity. 

(b) Since the proper time of a (stationary) observer at infinity is the same for any frame in the Schwarzschild 
spacetime then the coordinate time (i.e. the Schwarzschild temporal coordinate t) is global and it is com- 
mon to all frames. We note that this point may sound trivial but we made it explicit because it will be 
needed later. We should also note that the priority of being global may be reversed and hence we may 
state that the Schwarzschild temporal coordinate is global and hence the proper time of a (stationary) 
observer at infinity is global according to Eq. 244 (see § 6.3.3). We may also consider these as two 
independent facts. 


Exercises 


1. Is it useful to consider events at infinity which cannot be reached? 


6.3.3. Gravitational Time Dilation 


Let have two stationary frames: O, at rj and Og at rg where ry < rg. From Eq. 243 we have: 
IG? 2GM\"? 
dt = (1 = ) dt and dtz = (1 - ) dt (245) 


and hence dt, < dt because r1 < rp (see Problems). The significance of this is that the proper time 
interval of a frame that is deeper in a gravitational well is shorter than the corresponding proper time 
interval of a frame that is higher in the gravitational well (where this comparison is calibrated by the 
coordinate time interval dt which is global). In other words, time runs slower closer to the source of 
gravity where the gravitational field is stronger. So, we have gravitational time dilation where a frame 
farther from the source of gravity (i.e. at larger 7) observes the time of a frame nearer to the source of 
gravity (ie. at smaller r) to run slower (or alternatively a frame nearer to the source of gravity observes 
the time of a frame farther from the source of gravity to run faster). We should note that dt in Eq. 245 is 
not subscripted with 1 and 2 because coordinate time is common to all frames since it is global according 
to point (b) in § 6.3.2. 


Problems 


1. Determine if the factor (1 _- —— ) changes (i.e. increases or decreases) in the same sense as 7 or in 


24M \1/25 
1— er ) : 


the opposite sense. What about the factor ( 


Answer: It is obvious that if r increases/decreases then 2 decreases /increases and hence (1 - ee ) 
increases/decreases. This means that the factor (1 — 2G ) changes in the same sense as 1, i.e. 


(1 - ou ) is an increasing function of r. This should also apply to (1 - a eo because taking 


[199] The condition “stationary” is added here to exclude special relativistic effects caused by motion. 
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the square root does not affect the sense of inequality, i.e. if a > b then /a > vb (with a and b being 
non-negative). 
This should justify the statement in the text that dr, < drz because rj < ro. 

Exercises 


1. The statement: “the proper time interval of a frame that is deeper in a gravitational well is shorter 
than the corresponding proper time interval of a frame that is higher in the gravitational well” may be 
criticized by saying: you are comparing the time in two different local frames which have no common 
standard of calibration and hence the comparison is baseless. Discuss this issue. 


6.3.4 Gravitational Frequency Shift 


An effect that is directly related to the gravitational time dilation is the gravitational frequency shift. 
It is obvious from the definition of frequency v as the reciprocal of the periodic time T (i.e. v = 1/T) 
that any physical phenomenon that is periodic (and hence it has periodic time) like electromagnetic 
waves should have a frequency where the two are linked by this reciprocal relation. So, if the periodic 
time was influenced by gravity in a certain manner (i.e. dilation or contraction) then the frequency 
should necessarily be influenced by gravity in the opposite manner (i.e. contraction or dilation).“9 In 
other words, if the periodic time is gravitationally dilated then the frequency should be gravitationally 
contracted and vice versa. Accordingly, a wave that is ascending in a gravitational well should be red 
shifted while a wave that is descending in a gravitational well should be blue shifted (see exercise 1). 
In quantitative terms, if we substitute from Eq. 244 into Eq. 243 then we have: 


1/2 
dr = pexens AT x0 (246) 
cr 
1/2 
ie tee We (247) 
cr 
1 2GM\'? 4 
= = (1- 2 ) (248) 
V Cr Voo 
29GM\1/? 
Voo = (1- ) v (249) 


where in line 2 we use the periodic time T at a given local stationary frame to represent the proper time 
interval dv in that frame (and similarly for T,, with d7v,,), and in line 3 we use the relation T = 1/v. 
Noting that v is the frequency of the wave in the frame at r, Eq. 249 is the quantitative representation of 
the aforementioned fact that a wave ascending in a gravitational well is red shifted while a wave descending 
in a gravitational well is blue shifted. This fact may be more clarified and generalized by considering the 
relation between the frequency in two stationary frames: O; at r, with frequency 1; and O2 at ro with 
frequency v2 where ry < rg. From Eq. 249 we have: 


29GM\\/? 2GM\\/? 
Voo = (1 a, ) Wy= (1 a ) V2 (250) 
Cory cre 
and hence: t/a as 
2GM 2GM \ — 
aed i (251) 
V2 Cre er 


1/2 1/2 
Now, since r1 < rg then (1 - 2g) < (1 - 2g) (see § 6.3.3) and hence the last equation implies 


that vg < 1 (ie. the frequency 1, at r; is red shifted to the frequency v2 at ra by ascending in the 


1191] The use of contraction and dilation with respect to frequency is for the purpose of imitation; otherwise it may be more 
appropriate to use decrease and increase instead. 
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gravitational well, or alternatively the frequency v2 at r2 is blue shifted to the frequency 1 at r, by 
descending in the gravitational well). 


Exercises 


1. Justify the following statement: “A wave that is ascending in a gravitational well should be red shifted 
while a wave that is descending in a gravitational well should be blue shifted”. 


6.3.5 Comparison with Classical Mechanics and Special Relativity 


In classical mechanics, the time is global, absolute and independent of any local metric property!!9?! and 
hence none of the general relativistic implications that we investigated in the previous subsections (e.g. 
gravitational time dilation) do exist!!®3] because all the general relativistic implications are based on the 
locality and relativity of time (ie. physical or metrical time) and its dependence on the local metric 
properties of the spacetime. 

Regarding special relativity, the spacetime of special relativity is flat coordinated by global frames (i.e. 

one global frame for each observer) where each frame is distinguished from all other frames by its relative 
uniform motion with respect to the other frames or with respect to absolute frame if we believe in absolute 
frame (noting that the frames in special relativity are inertial). As a consequence, we have the following: 
(a) In special relativity the time is global and absolute in each frame (although it is relative across frames) 
unlike the time in general relativity which is local (i.e. it varies from one location of spacetime to another) 
and relative (i.e. it depends on the local frame). Moreover, the time in special relativity is independent of 
the local metric properties of the spacetime because these metric properties are uniform and they equally 
apply to all parts of the spacetime. Yes, when we compare the time in two special relativistic frames then 
the time in special relativity is also local in this sense (since each frame has its own time) and relative. 
Moreover, it depends on the “local metric properties” of the spacetime in this sense. 
(b) In special relativity time dilation (as observed across two global frames in relative motion with respect 
to each other) is caused by motion and hence it is a function of speed (i.e. it is a kinematical effect) while 
in general relativity time dilation is caused by gravity and hence it is a function of location (or local metric 
properties of spacetime), i.e. it is a gravitational effect. Yes, there should also be non-gravitational time 
dilation in the spacetime of general relativity (in addition to the gravitational time dilation) if the frame is 
moving and not stationary. Supposedly, the determination of this non-gravitational time dilation should 
in principle follow the rules of special relativity (and hence this non-gravitational time dilation can be 
labeled as special relativistic time dilation). However, this depends on several factors and considerations 
(see for example § 6.1; also refer to B4) which are not a priority for our investigation here due to the 
general relativistic nature of this investigation. 


Exercises 


1. Compare the time of occurrence and the time of observation in special relativity and in general rela- 
tivity. 
2. Compare between time dilation in special relativity and time dilation in general relativity. 


6.4 Length in Schwarzschild Spacetime 


To analyze the meaning and properties of length (or spatial interval or distance) in general relativity let 
use the Schwarzschild spacetime again as a typical general relativistic spacetime and use stationary!!%41 
frames of type c (refer to § 6.1) located at different positions in the Schwarzschild spacetime. In the 
following subsections we investigate several situations and settings to highlight a number of important 
issues about length in general relativity. 


192] Ty fact, the paradigm of metric space does not exist in the theoretical framework of classical mechanics or at least it is 
irrelevant. 

193] In fact, they do not exist as effects of the locality and relativity of time although some may exist as results of other 
classically established principles like the conservation of energy which can classically explain gravitational frequency 
shift (at least in some cases and circumstances). 

194] Again, stationary here means at rest spatially relative to the source of gravity. 
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6.4.1 Relation between Spatial Coordinates and Proper Length 


Let have a stationary frame at a certain position in the Schwarzschild spacetime and consider events that 
vary in space only. Now, since the variation is in space only then the variation in the temporal coordinate 
is null, i.e. dt = 0. Therefore, the Schwarzschild quadratic form (see § 4.1.1) will reduce to: 


(do)* = 


-1 
; ) (dr)? — r? (d0)” — r? sin? 6 (dd)? = — (ds)? (252) 
Cr 

where ds is the length (ie. 3D spatial interval in its infinitesimal form or 3D line element) which can 
be correctly described as the proper length (since it is the length measured at the same time and hence 
it is “proper” like the “proper” time interval which is the time interval measured at the same spatial 
location).!!%! Now, since we are investigating length then let use (ds)” instead of (do)*, that is: 


Cr 


(ds)? = (1 ae ) (dr)? +r? (d0)? + r? sin? 0 (dd)? (253) 


As we see, Eq. 253 is the same as the quadratic form of spherical coordinate system of a Euclidean space 


apart from the factor (1 eee ae In brief, Eq. 253 implies that the quadratic form (ds)? of the proper 


cr 
2GM 
Or 


length ds is modified from its spherical form by the factor (1 _ has in its radial component where 
this component varies by this factor as r varies (see exercise 1). 

However, this is a gross analysis because length as formulated by Eq. 253 is rather complex to analyze in 
one go due to the involvement of several variables. So, let analyze length in the Schwarzschild spacetime 
by using Eq. 253 but with some simplification by exploiting the fact that length in spherical coordinates 
(which in a sense underlie the Schwarzschild coordinates) can be resolved into a circumferentiall!% 
component (where r is held constant and hence dr = 0) and a radial component (where 6 and ¢ are held 
constant and hence d@ = dé = 0). In other words, the length is resolved into a component embedded in 
the r coordinate surface (which is the circumferential component) and a component along the r coordinate 
curve (which is the radial component). So, our analysis of length in the Schwarzschild spacetime will be 
simplified in the following discussion by analyzing these two components separately. 

Considering the circumferential component, we have dr = 0 and hence Eq. 253 becomes: 


(ds)? = r? (d0)* + r? sin? 6 (dd)? (254) 


which is the same as the quadratic form in spherical coordinates on an r coordinate surface (ie. on a 
sphere of constant r). This means that in the Schwarzschild spacetime there is no direct effect of gravity 
on the circumferential component of length and hence the r coordinate surfaces in the Schwarzschild 
spacetime are similar to their counterparts in a spherical coordinate system (see exercise 2). 

Considering the radial component, we have d? = df = 0 and hence Eq. 253 becomes: 


(ds)? = (: = cm ) (dr)? (255) 
fie 2 (1 7 ou ye a (256) 


which is the same as the line element in spherical coordinates along an r coordinate curve (i.e. the 


intersection of @ and ¢ coordinate surfaces) but with modification by the factor (1 - aoe ee The 
significance of Eq. 256 is that: 
(a) At finite r and for 25“ < 1 the proper length ds (in its pure radial form) is longer than the coordinate 


ae 


radial length dr by the factor (1— 2¢¢@)~1/?, 


or 


1195] We note that in this general relativistic context “proper” may be used rather differently from “proper” in its special 
relativistic sense. 
[196] The circumferential component may be labeled as azimuthal or transversal (although these might be less general). 


6.4.2 Interpretation of Spatial Coordinates 139 


(b) At infinite r (i.e. r — oo) the proper length ds (in its pure radial form) is equal to the coordinate 
radial length dr. 


Problems 
1. Justify point (a) and point (b) about the significance of Eq. 256. 
Answer: For finite r and 2G“ < 1 we should have (1 _ 2GM)~¥/? > 1. This is because (1 _ 2041) <1 


cr cr cr 
and hence (1 - aol Ve < 1 and therefore (1 - aol iw > 1 according to the rules of inequalities. 
This justifies point (a). 


2GM 
cr 


— 0 and hence (1 — 2GM)~1/ — 1. This justifies point (b). 


cr 


If r > o then 


Exercises 


: 5 -1 F : : : 
1. Determine if the factor (1 — 204 ) changes (i.e. increases or decreases) in the same sense as r or in 


the opposite sense (assuming 2G“ < 1). What about the factor (1 — 2G“ yn 
2. Comment on our statement: the r coordinate surfaces in the Schwarzschild spacetime are similar to 


their counterparts in a spherical coordinate system. 


6.4.2 Interpretation of Spatial Coordinates 


For very large r (i.e. r + 00) we have (1 — er ke = 1 and hence Eq. 253 becomes: 
(ds)” = (dr)* +r? (dO)? +r? sin? 6 (dg)” (257) 


which is no more than the quadratic form of spherical coordinate system of 3D Euclidean space. This 
should provide an interpretation for the spatial coordinates of Schwarzschild spacetime. In fact, this shows 
that the spatial Schwarzschild coordinates converge to the spherical coordinates at infinity. 

To clarify the situation further, we consider the radial component alone (as represented by Eq. 256) in 
our interpretation of the spatial coordinates since the circumferential component is not affected by gravity 
as we found earlier in § 6.4.1 (and also because of what we observed here about the significance of 6 and 
@). Accordingly, if r + co in Eq. 256 then we have: 


890 = dr (258) 


where we use the subscript oo to indicate the proper length at infinity. The significance of Eq. 258 is 
that: 

(a) The coordinate radial length dr is equal to the proper length ds (in its pure radial form) of a (stationary) 
observer at infinity. In fact, this can be used as a definition for the Schwarzschild radial coordinate r and 
hence we say: the radial coordinate r is the measure of proper length (in its pure radial form) of a 
(stationary) observer at infinity. 

(b) Since the proper length ds of a (stationary) observer at infinity is the same for any frame in the 
Schwarzschild spacetime then the radial coordinate r in the Schwarzschild spacetime is global and it is 
common to all frames. This may be reversed and hence we infer from the presumed global nature of the 
radial coordinate r the global nature of the proper length at infinity. We may also consider these as two 
independent facts. 


Exercises 


1. Use the spatial quadratic form of the Schwarzschild metric to interpret the spatial Schwarzschild 
coordinates. 

2. The Schwarzschild metric converges to the flat Lorentz metric at infinity. What is the significance of 
this on the meaning and interpretation of the Schwarzschild coordinates? 


6.4.3 Gravitational Length Contraction 


Let have two stationary frames: O; at r; and O2 at rg where r; < rg, and let consider the radial component 
alone in our analysis since the circumferential component is not affected (at least directly) by gravity as 
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we found earlier. From Eq. 256 we have: 
29GM\ 71/2 9GM\ 71/2 
ds, = (1 ee ) dr and ds_ = (1 28 ) dr (259) 


and hence ds; > dsz because r; < rg (see Problems). We note that dr is not subscripted with 1 and 2 
because the radial coordinate r (which the differential dr is based on) is common to all frames since it is 
global according to point (b) in § 6.4.2. The significance of Eq. 259 is that the proper length in a frame 
that is deeper in a gravitational well is longer than the corresponding proper length in a frame that is 
higher in the gravitational well. In other words, length is contracted farther from the source of gravity 
where the gravitational field is weaker. So, we have gravitational length contraction in the radial direction 
where a frame nearer to the source of gravity (i.e. at smaller r) observes the length in a frame farther 
from the source of gravity (ie. at larger r) to be contracted (or alternatively a frame farther from the 
source of gravity observes the length in a frame nearer to the source of gravity to be dilated). 

We should finally remark that gravitational length contraction may not be seen in the literature of 
general relativity to be as natural and authentic as gravitational time dilation (in fact some authors 
describe it as apparent). This may be justified by some by the non-metrical (or non-physical) significance 
of coordinates in general relativity.!!97l However, this justification can be refuted by its applicability even 
to gravitational time dilation. In fact, the supposed uniformity of spacetime in all its dimensions (whether 
temporal or spatial) dictates that any temporal effect (like time dilation) should have a corresponding 
spatial effect (like length contraction) and vice versa and hence any consistent physical theory about 
spacetime (as general relativity is supposed to be) should guarantee the equivalence between temporal 
and spatial effects and consequences. So, gravitational length contraction should be as natural and 
genuine as gravitational time dilation (at least according to the Schwarzschild metric whose implications 
are supposedly supported experimentally in certain physical situations) and hence if gravitational time 
dilation is natural and genuine then gravitational length contraction should also be natural and genuine 
while if gravitational length contraction is not natural and genuine then gravitational time dilation should 
also be not natural and genuine. In fact, if a spacetime theory does not meet this condition (i.e. equivalence 
and correspondence between temporal and spatial effects) then its consistency and soundness should be 
questioned because it discriminates between the dimensions of the presumably-uniform spacetime.!!95 


Problems 


1. Justify the following: if ry < rg then ds; > dso. 


Answer: The factor (1 - ae My He changes in the opposite sense to r, i.e. (1 - eM isa 


decreasing function of r (see exercise 1 of § 6.4.1). This means that if r; < rg then we should have 


-1/2 -1/2 
(1 _ pt) > (1 - 2g) and hence from Eq. 259 we conclude that ds, > ds2, as required. 


Cer, C2 rg 


Exercises 


1. Discuss how the issue of gravitational length contraction is dealt with in the literature of general 
relativity. 


6.4.4 Comparison with Classical Mechanics and Special Relativity 


In classical mechanics, the space is absolute and flat (in fact it is Euclidean) and hence length is global 
and independent of any local metric property. Therefore, none of the general relativistic implications that 
we investigated in the previous subsections (e.g. gravitational length contraction) do exist because all the 


[197] Ty fact, this seems to be based on comparing the coordinate and metrical variables (e.g. dr and ds; in Eq. 259). But 
this is not necessary because as we stated earlier the comparison can be between the metrical variables (e.g. ds; and 
dsz in Eq. 259) at two locations (one of which could be at infinity, where the coordinate and metrical variables agree, 
although this is not necessary). 

[198] Tn fact, there are many issues related to this discussion some of which have been touched on (from a special relativistic 
perspective) in B4 such as the difference between space and time in having freedom of movement in space but not in 
time. However, we see no urgency in discussing these issues here. We should also note that “uniform” here means in its 
dimensions (i.e. all dimensions have the same contribution and significance to the manifold) as indicated earlier. 
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general relativistic implications are based on the locality and relativity of length and its dependence on 
the local metric properties of the spacetime. 

Regarding special relativity, the spacetime of special relativity is flat coordinated by global inertial 
frames (i.e. one global frame for each observer) where each frame is distinguished from all other frames 
by its relative uniform motion with respect to the other frames (or with respect to absolute frame if we 
believe in absolute frame). As a consequence, we have the following: 

(a) In special relativity the length is global and absolute in each frame unlike the length in general 
relativity which is local (i.e. it varies from one location of spacetime to another) and relative (i.e. it 
depends on the local frame). Moreover, it is independent of the local metric properties of the spacetime 
because these metric properties are uniform and apply equally to all parts of the spacetime. Yes, when 
we compare the length in two special relativistic frames then the length in special relativity is also local 
in this sense (since each frame has its own length) and relative. 

(b) In special relativity length contraction (as observed across two global frames in relative motion with 
respect to each other) is caused by motion and hence it is a function of speed (i.e. it is kinematical) while 
in general relativity length contraction is caused by gravity and hence it is a function of location (or local 
metric properties of spacetime), i.e. it is gravitational. Yes, there should also be non-gravitational length 
contraction in the spacetime of general relativity (in addition to the gravitational length contraction) if 
the frame is moving and not stationary. Supposedly, the determination of this non-gravitational length 
contraction should in principle follow the rules of special relativity (and hence this non-gravitational length 
contraction can be labeled as special relativistic length contraction). 


Exercises 


1. Is there a similarity between length contraction in special relativity and in general relativity? If so, 
what? 

2. Comment on the fact that physical effects like time dilation and length contraction occur in special 
relativity only across global frames (each of which covers the entire spacetime) while in general relativity 
they occur across local frames (each of which covers only part of the spacetime). 


6.5 General Relativity and Absolute Frame 


The issue of absolute frame is central to the theory of special relativity (or what we call Lorentz mechanics) 
but it does not seem to be central to the theory of general relativity. We discuss in the following points 
some potential reasons for this difference: 

1. In the spacetime of special relativity the demand for an absolute frame is natural (as we demonstrated 
in B4) because we have a single global frame that covers the entire spacetime. But in the spacetime of 
general relativity what we have in reality is a collection of local frames pieced together and hence the 
demand for absolute frame seems less obvious and urgent because we are always focusing on a certain 
part of the spacetime. However, this difference should not lead to the conclusion that the issue of 
absolute frame is irrelevant to general relativity. The reason is that the spacetime of general relativity 
is a modified version of the spacetime of special relativity and hence this demand in special relativity 
should be inherited in general relativity. 

2. In special relativity, the existence (or non-existence) of an absolute frame is a fundamental issue because 
special relativity is primarily a theory about space and time and their transformations. This issue may 
seem less important (or even irrelevant) in general relativity which primarily is a theory of gravity 
because gravity is essentially an interaction between objects and this interaction can be described 
satisfactorily by using any local frame that embraces the interacting objects without worrying about a 
global absolute frame.!!99l This should apply even to the classical theory of gravity in which gravity is 
a force and hence the issue of absolute frame is a secondary factor that should not require particular 
attention. However, this may be refuted by the fact that general relativity (especially in its extended 


1199] Interaction may suggest classical meaning since gravity in general relativity is an action of matter on the geometry 
of spacetime with the gravitated object being affected by this geometry. However, this should still be an interaction 
between the gravitating and gravitated objects through the geometry of spacetime. Anyway, this is just a matter of 
language and expression and hence it should not affect the essence of the point. 
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form as a “General Theory” that includes special relativity) is essentially a theory of space and time and 
hence it is like special relativity in this regard.?°°l This should be endorsed by the fact that gravity in 
general relativity (according to the principle of metric gravity; see § 1.8.4) is an effect of the distortion 
of spacetime rather than a force. We should also mention the need of the equivalence principle (which 
is fundamental to the theory of general relativity even as a gravity theory) to a sensible and realistic 
definition of acceleration which requires absolute frame, as will be discussed later. 


Hence, we can conclude that the issue of absolute frame is central even to the theory of general relativity 
and therefore it is important to pose the following question: does general relativity require the existence 
of an absolute frame of reference to make sense of its formalism and provide a logical interpretation? In 
other words, is there an epistemological necessity for the formalism of general relativity to have (or not 
have) a unique absolute frame, and what is the significance of the existence of absolute frame on general 
relativity? So, let inspect this issue in more details. 

According to general relativity, a frame in a state of gravitational free fall (i.e. relative to the source 
of gravity) is locally equivalent to an inertial frame. It should be obvious that the validity of this claim 
is based on the denial of the existence of absolute frame because if we have an absolute frame then the 
entire system (i.e. the source of gravity plus the freely falling frame) could be accelerating relative to 
the absolute frame and this accelerated motion is equivalent to the existence of another source of gravity 
relative to which the frame is not in a state of free fall and hence it is not inertial (even locally). In brief, 
the general relativistic claim that “freely falling frame is locally inertial’ requires the denial of absolute 
frame (noting that free fall is defined relative to the source of gravity). 

This is about the issue of freely falling frame. Regarding the issue of non freely falling frame (regardless 
of being moving or at rest relative to the source of gravity), we just repeat what we said in the case of 
freely falling frame about the possibility of the existence of an absolute frame that contains the entire 
gravitational system. Accordingly, a frame that is not in a state of gravitational free fall (i.e. relative to 
the source of gravity) could be actually in a state of “free fall” and hence it is inertial. This is because it can 
be accelerating relative to the absolute frame in such a way that its acceleration annuls the gravitational 
effect of the source of gravity. So, in both cases (i.e. freely falling frame and non freely falling frame) the 
validity of the equivalence of free fall to inertiality (which is based on the equivalence of acceleration to 
gravity which is the essence of the equivalence principle) requires the denial of absolute frame. 

Now, this denial could be sensible and acceptable as long as we restrict our attention to gravity. However, 
if we look to the wider picture where we are supposed to believe in the validity of special relativity (and 
especially if we incorporate special relativity in the framework of general relativity) then our position 
cannot be maintained unless we extend this denial to special relativity (since we cannot deny it in general 
relativity and accept it in special relativity). As we saw in B4, the sensibility of special relativity decisively 
depends on the existence of absolute frame because even the domain of special relativity (which is inertial 
frames) cannot be defined in a physically realistic and sensible way without the existence of absolute 
frame. In fact, even the formalism of special relativity necessitates the existence of absolute frame if it 
should have a sensible and logical interpretation and avoid obvious paradoxes and contradictions. So, if we 
accept the existence of absolute frame (where this existence in our view is necessitated by special relativity 
which is supposed to be a valid theory) then neither the freely falling frame is necessarily inertial nor the 
non freely falling frame is necessarily non-inertial and this should mean invalidation of the equivalence 
principle which is one of the pillars of general relativity. 

Anyway, the existence of absolute frame is denied by general relativity (or at least by general relativists 
or the majority of them), and this denial is inconsistent with the fact that in special relativity accelerating 
frames were excluded from the theory because the theory is restricted to inertial frames. This exclusion 
means that there is at least an implicit confession in special relativity (which is still valid in general 
relativity) of the existence of accelerating frames in spacetimes in which no gravity does exist. The 
significance of this is that acceleration is an attribute that is determined and defined by the spacetime 
alone regardless of the existence or non-existence of any source of gravity, i.e. the existence of acceleration 


[200] As discussed earlier, general relativity in its extended form has a gravitational component and a special relativistic 
component, and hence even if the gravitational component does not need absolute frame (which we reject) the special 
relativistic component should need absolute frame. 
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is independent of the existence of gravitation. Accordingly, acceleration (which is equivalent to gravitation) 
can exist even in spacetimes in which gravity does exist (where this acceleration is defined relative to a 
frame other than the frame of the gravitating object). As shown in B4 and indicated above, the existence 
of accelerating frames requires the existence of absolute frame and we see nothing in general relativity 
that can remove this requirement. In other words, the mere inclusion of gravity in general relativity and 
the claim of the equivalence between gravity and acceleration does not lead to the abolishment of the 
logic of the absolute frame that we established in our discussion of special relativity. 

The situation may be clarified further by considering the regions of spacetime of general relativity far 
away from matter where the spacetime is essentially the same as the spacetime of special relativity and 
hence it should be possible to have accelerating frames without any reference to gravitating matter (whose 
existence is irrelevant in those regions). In fact, considerations like this may be behind the claim that 
the spacetime of special relativity is a fictitious idealization, but this should lead to questioning even the 
validity of general relativity itself since special relativity is supposed to be at the heart of general relativity 
due to the local application of special relativity in the spacetime of general relativity. In fact, we will 
see later that special relativity (or what we call Lorentz mechanics) is not an idealization at least from a 
practical perspective if not from a theoretical perspective. 

We note that the literature of general relativity is full of examples that indicate the necessity of the 
existence of an absolute frame if we have to logicalize this theory and make sense of it. For example, the 
famous thought experiment of a rocket accelerating in free space indicates the necessity of the existence 
of absolute frame because in the absence of absolute frame the acceleration of the rocket and the observed 
“quasi-gravitational” field cannot be explained.?°!] The necessity of absolute frame even in general rela- 
tivity can also be inferred from the fact that the frame in this thought experiment and its alike is assumed 
to be irrotational which cannot be explained unless we assume the existence of an absolute frame relative 
to which the rotation can have significance and real physical effect. In other words, the absolute nature 
of rotation implies the absolute nature of spacetime and hence the existence of an absolute frame. 

These examples may be challenged by the claim that acceleration and rotation are in reference to another 
frame and hence we do not need a global and unique absolute frame. But this can be refuted by the fact 
that the other frame is not an arbitrary frame. In other words, while the acceleration and rotation relative 
to a certain category of frames have accelerating and rotating effects the acceleration and rotation relative 
to another category of frames have no such effect and hence we should have a physically realistic sense 
of acceleration and rotation by having an absolute frame relative to which the acceleration and rotation 
have real physical effects. 

In brief, the effect of “accelerating /rotating” and “non-accelerating/non-rotating” is a real physical effect 
that can be observed and measured regardless of any particular reference frame except a unique reference 
frame which is the “absolute non-accelerating/non-rotating frame”. The uniqueness of this absolute frame 
should be obvious since only those frames that are non-accelerating/non-rotating relative to this frame 
do not observe accelerating /rotating effects while all the other frames experience such effects where these 
effects are quantified by the magnitude and direction or sense of the acceleration/rotation relative to this 
unique frame. Since such a frame can be determined by the sole entity of spacetime then the existence of 
any particular source of gravity in the spacetime should not affect its accelerating and rotating effects.|?°7| 


Problems 


[201] For example, we read in one of the textbooks (see page 114 of Lambourne et al. in the References): Another thought 

experiment involves a rocket in a region in which there is no gravitational field. If the rocket is accelerated with a 
uniform acceleration of magnitude g, no sufficiently localized experiment within the rocket can distinguish between the 
consequences of the acceleration and the gravitational field on the surface of the Earth. An object released from rest 
within the rocket would accelerate downwards, just as an object on Earth would do. (End of quote) 
So we may ask: why the “object released from rest” should not stay at rest in this accelerating frame (as it should do 
if the frame is inertial) or move uniformly if acceleration is defined relative to an arbitrary frame (noting that no real 
physical effect can depend on acceleration relative to an arbitrary frame)? Similar challenges can be directed to other 
examples and thought experiments like the bending of light in a frame that is accelerating with respect to another frame 
in which the light follows a straight trajectory (see for example page 116 of Lambourne et al.). 

[202] The phrase “sole entity of spacetime” should not imply taking a definite view about Mach principle and that is why we 
added “particular” (see § 6.5.1). 
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1. Explain briefly the significance of the existence of absolute frame on the equivalence principle and 
general relativity. 
Answer: The existence of absolute frame means that in the spacetime of general relativity we have 
accelerated motion (which is equivalent to a gravity that is not annulled by free fall relative to a source 
of gravity) regardless of the existence or non-existence of any matter or energy in the spacetime. 
Accordingly, even a freely falling frame (i.e. relative to the source of gravity) can be an accelerating 
frame (i.e. relative to the absolute frame) and hence it is equivalent to a gravitational frame according 
to the principle of equivalence (see § 1.8.2). Similarly, even a non freely falling frame (e.g. stationary 
frame relative to the source of gravity) can be an inertial frame (if the effect of gravity is annulled by the 
acceleration relative to the absolute frame). Therefore, the existence of absolute frame should impose 
restrictions on the validity of the equivalence principle, i.e. the validity of this principle requires 
the inclusion of the effect of any potential acceleration relative to the absolute frame. This should 
also raise a question about the wisdom (or even correctness) of modeling gravity as a curvature in 
spacetime and if this curvature approach is needed (or even sensible) at all because if the spacetime of 
general relativity is defined by this background absolute frame (which should sensibly and naturally 
coordinate a flat spacetime) then gravity can be a physical phenomenon in a flat spacetime (rather 
than a geometric attribute of a curved spacetime) and hence gravity (like any other force) is a physical 
phenomenon contained in the spacetime rather than being an attribute of the spacetime itself. In 
other words, the classical paradigm of force and gravitational field is sufficient (or even necessary) for 
modeling gravity while the general relativistic paradigm of curvature and metric gravity is redundant 
(or even wrong). If so, then all we need to generalize special relativity (assuming this generalization is 
viable)?! is to look for a theory that generalizes the known “inertial” physical laws and extend them 
from the domain of inertial frames (where special relativity is supposed to be valid) to the domain of 
non-inertial frames and hence we have only two types of frame (i.e. inertial and accelerating). 

2. By analyzing the concept of acceleration in general relativity show that the theory is inconsistent in 
some of its aspects. 
Answer: We note that acceleration in some aspects of the theory is referred to free space (as seen 
for instance in the rationale of the accelerating rocket in free space which is supposed to be equivalent 
to gravitational frame) while in some other aspects it is referred to the source of gravity (as seen for 
instance in the rationale of the free fall state). In other words, while the presence of an absolute frame 
is required in the former to make sense of the inferred physical consequences (otherwise real physical 
effects in the accelerating rocket for instance should depend on its reference to an arbitrary frame; 
see the upcoming note), the absence of an absolute frame is required in the latter to make sense of 
the inferred physical consequences (otherwise a stationary frame for instance can be inertial). This 
inconsistency casts a shadow in particular on the equivalence principle and its logical and physical 
consequences which permeate the entire theory of general relativity. 
Note: let have two frames, A and B, which are accelerating relative to each other. Now, if a massive 
object in space is seen stationary in one frame then it should be accelerating in the other frame. This 
means that an object released from rest within an accelerating rocket (which we can label as frame 
A) will not necessarily accelerate downwards unless the rocket is accelerating relative to a frame (say 
frame B) relative to which the object is not accelerating (say at rest). This means that for the object 
to accelerate downwards, frame B should be a specific frame (i.e. a frame in which the object is at 
rest) and not arbitrary frame. So, frame B should be an absolute frame (or have a certain relation to 
an absolute frame). 


Exercises 


1. Discuss the proposition that general relativity should require (at least in some of its details) an absolute 
frame of reference because acceleration (which is supposed to be equivalent to gravity according to the 
equivalence principle) does not have sensible interpretation without such a frame of reference. 


[203] Tf the generalization is not viable then we need to look for an entirely different theory that represents the physical laws 
in accelerating frames (assuming that all types of accelerating frames have the same laws; otherwise we need to look for 
a theory for each type of accelerating frames). 
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2. Discuss the claim that absolute frame is irrelevant to general relativity because the motion in spacetime 
is explained by the concept of geodesics in spacetime. 

3. What is the significance of bending of light signals in accelerating frames with regard to the invariance 
of the speed of light and the necessity of absolute frame? 

4. Discuss some of the implications of the existence of an absolute frame in the spacetime of general 
relativity. 

5. Discuss the issue of absolute frame as an epistemological or philosophical issue and its relation to 
science from these perspectives. 

6. What is the significance of the conservation of momentum and angular momentum on the existence of 
absolute frame? 

7. Is there physical evidence for the existence of absolute frame? 

Challenge general relativity on the basis of the existence and non-existence of absolute frame. 

9. Follow the logic of the equivalence principle to analyze a gravitational system in which the gravitating 
object is at rest or moving uniformly relative to absolute frame (i.e. the frame of the gravitating object 
is inertial) while the gravitated object falls freely. 

10. What is the significance of the existence of orbiting systems (e.g. solar system) on the absoluteness of 
rotation and the existence of absolute frame? 


oe 


6.5.1 Absolute Frame and Mach Principle 


Apart from its relation to the issue of absolute frame (which in our view is a central issue to general 
relativity), Mach principle has a link to the historical development of general relativity and this is a 
good reason on its own for investigating this principle in this book. According to this principle, which is 
attributed to Mach although it seems to have origin in the writings of other philosophers prior to Mach, 
forces and accelerations experienced by objects in non-uniform motion are determined by the distribution 
of matter in the Universe rather than by the existence of an abstract absolute frame of reference. In fact, 
this principle was originally proposed as an alternative to the hypothesis of the existence of an absolute 
frame (or space) in the Newton’s bucket argument where absolute rest and motion (such as the rotation 
of bucket) can be defined with respect to this absolute frame according to the Newtonian view while 
according to the Mach principle the effect seen in the bucket as a result of rotation is not because of the 
existence of an absolute frame of reference but because of the existence of matter in particular quantity 
and distribution in the Universe that surrounds the bucket (where the existence and distribution of matter 
in the surrounding Universe may be defined primarily by reference to the “distant stars”). 

Apart from the purely philosophical contemplations and futile disputes that are of little interest to us 
as physicists, we do not see a fundamental contradiction between the hypothesis of the existence of an 
absolute frame to which absolute spacetime is referred and the hypothesis of the effect of the distribution 
of matter in the surrounding Universe in defining the state of rest and motion and determining the agents 
of force and acceleration. What we should be interested in as physicists is a practical operational definition 
and determination of a frame of reference that is absolute in a sense to define the state of rest and motion 
sensibly, unambiguously and absolutely. Whether this frame belongs to an abstract absolute spacetime or 
it originates from the existence and distribution of matter in the surrounding Universe is irrelevant from 
a physical perspective. After all, we cannot remove this surrounding matter to have physical experiences 
in a “clean” spacetime that is void of matter. In brief, what is physically important is the existence of a 
unique frame relative to which rest, motion, acceleration and force can be uniquely and sensibly defined 
and determined. Whether this frame is the frame of absolute spacetime (due to its intrinsic properties 
inline with the Newtonian view) or the frame that is established by the particular distribution of matter 
in the Universe (and hence it can be seen as an extrinsic property inline with the Machian view) is 
unimportant and physically irrelevant. 

It should be remarked that in the literature of the relativity theories we can find numerous proposals 
of direct and indirect physical tests and consequences that could in principle distinguish between the 
Newtonian paradigm of absolute space and Mach principle. However, many of these proposed tests 
and consequences are arguable in their validity and significance as they normally depend on particular 
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theoretical frameworks whose validity are not universally approved. In fact, many of these proposals are 
based on personal stereotypes and convictions rather than on rigorous logic and solid science. Anyway, 
this should not affect our assertion about the irrelevance of the above controversy between the Newtonian 
and Machian paradigms because this irrelevance is related specifically to the Newtonian and Machian 
paradigms on the large scale of the Universe and are not concerned with any particular (and normally 
local) effects and consequences of these paradigms. In our view, there is no viable physical test that can 
distinguish between these two paradigms on this large scale and hence the difference from this perspective 
is irrelevant. Moreover, even if such a test does exist it has no practical consequence on the paradigm of 
“absolute frame” in its generic sense that in principle can include both the Newtonian absolute space and 
the Machian world (i.e. a world subject to the Mach principle). 


Exercises 


1. Describe the essence of Mach principle in a few words. 

2. Consider two spheres S$; and S» rotating relative to one another and assume that S$» bulges at its 
equator, how do we explain this difference? According to Einstein “No answer can be admitted as 
epistemologically satisfactory, unless the reason given is an observable fact of experience ... Newtonian 
mechanics does not give a satisfactory answer to this question. It pronounces as follows: The laws of 
mechanics apply to the space R, in respect to which the body Sj is at rest, but not to the space R2 
in respect to which the body Sz is at rest. But the privileged space R, ... is a merely factitious cause, 
and not a thing that can be observed”. Discuss this issue. 

3. Assess (in general terms) the claimed evidence in support and against Mach principle and its link to 
general relativity. 


Chapter 7 
Physics of General Relativity 


The purpose of this chapter is to give a general view about the physics of general relativity and assess 
its nature with some comparisons with the physics in special relativity and occasionally with classical 
physics. The chapter also includes some of the commonly used formulations and equations in general 
relativity about the physics of motion in spacetime. However, before we go through these details it is 
important to make a general remark about the nature of the physics of general relativity and its relation 
to the physics of special relativity which is the precursor to general relativity (noting that this remark 
has mainly theoretical value with no practical impact on the forthcoming investigation). As we know, 
the distinctive feature of the physics of general relativity is the presence of gravity as a metric agent 
that shapes the spacetime and hence influences the physics that takes place in it. So, it is important 
to formulate a general recipe for incorporating the effect of this agent in the physics and hence create a 
link between the physics in the presence of gravity (i.e. general relativity) and the physics in the absence 
of gravity (i.e. special relativity). This recipe is essentially based on the strong equivalence principle 
and is formulated in its most elegant form in the Principle of General Covariance (see § 1.8.2) which 
dictates that the effect of gravity on any physical system involving physical quantities and relations can 
be determined by formulating the laws of the system in its special relativistic tensorial form followed by 
applying a general coordinate transformation with the replacement of 7,,, with g,, and the replacement 
of ordinary derivatives with their equivalent tensor derivatives. Accordingly, we can say that the physics 
of general relativity is a modified version of the physics of special relativity. In other words, the physics of 
general relativity is a curved (or distorted) version of the flat (or straight) version of the physics of special 
relativity. This should be appreciated in the light of the principle of metric gravity (see § 1.8.4) where 
the role of gravity according to this principle is to curve the spacetime and hence “distort” the “straight” 
laws of the flat spacetime of special relativity. 


7.1 Coordinates of Spacetime 


Coordinates of spacetime in general relativity (assuming mostly a Schwarzschild metric in Schwarzschild 
coordinates) have been investigated thoroughly earlier (see § 6) and hence we do not repeat. The main fact 
that should be kept in mind from our previous investigation is that in the spacetime of general relativity 
we have coordinate variables (like coordinate time t) and physical variables (like proper time 7) where 
the two are linked through metrical relations, i.e. relations based on the Schwarzschild metric in our 
case. This fact should be considered in all other physical quantities and relations that depend on the 
coordinates of spacetime. Accordingly, we note the following: 

e The physical significance of spacetime coordinates in general relativity is different from that in special 
relativity and in classical physics since in special relativity and classical physics the coordinates have 
direct physical significance and metrical value (due to the flat nature of the underlying spacetime), while 
in general relativity (where the spacetime is curved) the coordinates are mere labels and markers to 
identify points and events in spacetime and hence we need metrical relations to convert these coordinates 
to physically significant quantities. 

e The physical observations in the spacetime are presumably independent of the employed coordinate 
systems. Accordingly, the choice of coordinate systems is usually a matter of preference and convenience, 
and hence in principle any coordinate system is legitimate to use as long as it fulfills its basic function. 
However, in most cases there are advantages and disadvantages in using certain coordinate systems in 
certain cases, e.g. using systems that demonstrate symmetry or do not lead to singularity (see exercise 
11 of § 2.1). In fact, the issue of the choice of coordinate systems in general relativity is more sensitive 
and grave and can be more than a matter of having advantages and disadvantages. The reason is that 
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there are many examples in the physics of general relativity where certain coordinate systems can lead 
to misleading results and suggest wrong conclusions. Accordingly, in general relativity the choice of 
coordinate systems and the interpretation of their results require extra care and attention, unlike special 
relativity and classical mechanics where the situation is usually clear and straightforward. 


7.2 Time Interval and Length 


Time interval and length are physical quantities that basically represent relations between coordinates in 
spacetime (primarily in flat spacetime and through metrical relations in curved spacetime). Therefore, time 
interval and length in the spacetime of general relativity should be extracted from the metrical relations 
that are based on the coordinates of spacetime, as explained in § 7.1. In fact, we have investigated the 
core of these issues in § 6.3 and § 6.4 and hence we do not repeat. However, we would like to highlight 
the following important points: 

e Considering the presumed validity of special relativity in the spacetime of general relativity (regardless 
of the details), time interval and length (as well as the physical quantities that depend on them) can 
be affected non-gravitationally (i.e. kinematically) as well as gravitationally and hence in some cases 
we should consider the combined result of these special and general relativistic effects. In fact, in some 
disciplines, like relativistic cosmology, we may even need to consider effects that are cosmological (i.e. 
neither gravitational nor kinematical) as well. 

e Time interval and length are coordinate dependent and hence they are functions of the 4-position in 
spacetime. Accordingly, they are relative and have no absolute sense on a global scale!?°4! unlike their 
status in classical mechanics (and even in special relativity where they are global and absolute in each 
inertial frame). The significance of this is that physical quantities that depend on them (such as energy) 
are also relative and have no absolute significance on a global scale, and this has a direct impact on 
essential physical principles, like the conservation of energy, which lose their significance on a global level 
in the spacetime of general relativity (see for example § 7.9). 


Problems 


1. In § 6 we accepted calibration of basic physical quantities (mainly time and length) using local standards 
(mainly at infinity) while here we claim that there is no global or absolute physical significance of 
physical quantities that depend on coordinates due to the dependence of these quantities on the local 
metrical properties. How to reconcile these seemingly contradictory views? 

Answer: Our view is that although the calibration by local standards at infinity gives a legitimate 
basis for comparison, it is not sufficient to give a universal physical significance to coordinate-based 
physical quantities in curved spacetimes, and hence the lack of global physical significance cannot be 
avoided in curved spacetimes despite our ability to make sensible comparisons by the calibration at 
infinity. In brief, being able to make comparisons thanks to calibration by local standards at infinity 
does not imply global physical significance. Moreover, the calibration at infinity may apply to certain 
metric spaces (e.g. Schwarzschild) but not to all metric spaces and hence in the metric spaces that 
have no sensible calibration at infinity there should be no global physical significance.!?°! In fact, 
even if this applies to some metric spaces it should be enough for denying global physical significance 
in general due to the violation of the general validity of global physical significance. 

Anyway, this (i.e. seeming contradiction) is an issue for general relativity to address and tackle and 
hence it is not our problem although in the forthcoming parts of the book we will keep advocating and 


[204] Although they have no absolute sense on a global scale they can still be calibrated by using local standards (e.g. at 
infinity) to make useful comparisons, as explained earlier. We should also note that concepts and quantities related to 
time and length (such as simultaneity) should also lose their global significance and unique meaning and hence they 
become somewhat ambiguous (although we can still make useful ordering relations based on using local standards and 
references). 

[205] Th fact, this is based on considering (implicitly) that the physical quantities at infinity have a sort of global significance 
(due to the flat nature of spacetime there as in special relativity) unlike other locations in spacetime. However, we 
should note that “at infinity” is also local and hence the calibration at infinity is like the calibration at any other locality 
(which obviously has no global significance). 


7.3 Frequency 149 


building on our view by considering this as an issue of lack of global physical significance even though 
useful comparisons (by calibrating by local standards) can still be made. 


Exercises 


1. Why coordinate dependent quantities lose their global significance despite the ability to calibrate by 
local standards (e.g. at infinity)? 

2. Discuss, briefly, the physical effects that influence time interval and length in general relativity and 
compare them to those in special relativity. 


7.3 Frequency 


Frequency in general relativity has its normal and common meaning although it is subject to gravitational 
effects. In § 6.3.4 (and later in § 8.4) we investigated (rather thoroughly) frequency and how it is affected 
by gravity in the Schwarzschild spacetime and hence we do not repeat. However, it is important to note 
the following points about the frequency in general relativity: 

e The gravitational effects that influence the frequency in the spacetime of general relativity should 
also influence the energy through the Planck’s relation and this should affect the principle of energy 
conservation (at least as applied to electromagnetic frequency and energy). 

e Frequency may also be affected kinematically (as a consequence of the application of special relativity) 
and even cosmologically. 

e Frequency has no global physical significance thanks to its dependency on physical time (which in general 
relativity is local and metric dependent). 


Exercises 


1. Compare frequency in general relativity with frequency in special relativity and in classical physics. 


7.4 Mass 


Mass in general relativity is a common and unique concept that can be described loosely as an invariant 
scalar physical quantity that contributes (among other physical quantities) to the formation of the geome- 
try (and hence the metric) of spacetime. In fact, mass in general relativity inherits all its basic properties 
(e.g. its equivalence to energy according to the relation Ey = mc?) from special relativity (and to some 
extent even from classical physics) with the addition of its new qualification as an agent of gravity. How- 
ever, as discussed in several places in the book the classical concept of gravitated mass (as such) is made 
redundant by the paradigms of geodesic and free fall gravitation where (free) gravitated objects follow 
geodesic pathways in the spacetime regardless of their “gravitated” mass (and indeed regardless of even 
being massive or massless). In fact, even the concept of gravitating mass (as such) could be redundant 
since the redundancy of the gravitated mass could lead to the redundancy of gravitating mass due to 
their binary relation. Accordingly, the fundamental role of mass (and hence its qualification) in general 
relativity as a gravity theory is essentially to create the geometry of spacetime and determine its metric 
properties and structure. 

It should be remarked that the apparently global and absolute nature of mass in general relativity may 
be inconsistent with the local nature of energy (see § 7.8) as can be seen vividly in the mass-energy 
equivalence relation (i.e. Eg = mc? where Ep is supposedly local while m is supposedly global)?°°! unless 
we assume that certain types of energy (e.g. energy of rest mass) is global while other types of energy 
(e.g. energy of electromagnetic radiation) is local. However, this seems nonsensical and accordingly this 
issue could indicate an inconsistency in the theoretical structure of general relativity. In this context, 
we should note that the Poincare mass-energy relation of special relativity still holds in general relativity 
although the contribution of some types of energy to mass may not hold. For example, position-dependent 
gravitational potential energy does not contribute to mass although this may not be undisputed issue (in 
fact such type of energy does not exist in a strict sense within the theoretical framework of general 


[206] T¢, may be claimed that the local nature of Eg and the global nature of m are not inconsistent if we note that c in general 
relativity is local. 


7.5 Velocity, Speed and Acceleration 150 


relativity). This could mean that the mass-energy relation must hold in general relativity exactly as in 
special relativity with no additional contribution from gravity (which is what distinguishes the spacetime 
of general relativity from the spacetime of special relativity) although other factors should be taken into 
consideration.|2°71 In fact, a close inspection to these issues and their alike could lead to serious challenges 
to general relativity and the consistency of its theoretical framework. 


Exercises 


1. Why mass in general relativity should be global while quantities like length and time interval should 
be local? 

2. Compare mass in general relativity with mass in special relativity. 

3. Discuss the issue of the nature of mass in general relativity (as being local or global) from the per- 
spective of proper and improper mass (assuming the validity of special relativity in the spacetime of 
general relativity). 


7.5 Velocity, Speed and Acceleration 


Although the basic definitions of the concepts of velocity and speed in general relativity are essentially the 
same as in classical mechanics and Lorentz mechanics, these concepts should have no global significance 
thanks to their dependence on length, time interval and direction which are not well defined globally in 
general relativity. The obvious example of this is the speed of light which (according to general relativity) 
is constant locally but not globally (due to the lack of global significance of distance and time). So, in 
essence the constancy of the speed of light is not valid in general relativity although this is justified by 
the distinction between local and global speed (see § 10.1.13).1?°8! Like velocity and speed, acceleration 
in general relativity is not well defined globally due to its dependence on distance, time and direction. We 
should also refer the reader to § 7.13 about geodesic deviation which is related to acceleration. 


Exercises 


1. It may be claimed that mathematical constructions and techniques like parallel transport can give 
global significance to physical quantities that depend on coordinates and orientations. Discuss this 
claim briefly. 


7.6 Force 


General relativity does not include the paradigm of gravitational force in its theoretical framework and 
formalism since it is replaced by the paradigms of geodesic motion and free fall gravitation which are 
based on the principle of metric gravity. Yes, there is a place for the paradigm of non-gravitational force 
in its theoretical framework and formalism (since non-gravitational forces, like electromagnetic, do exist 
in general relativity). However, force (like acceleration) should have no global significance in general 
relativity due to its dependency on local quantities and entities like time and direction (unlike its status 
in special relativity). 


7.7 Momentum 


Both linear and angular momentum should have no global physical significance in general relativity because 
direction and orientation are essential in the definition of linear and angular momentum, whereas in the 
curved spacetime of general relativity direction and orientation are not well defined globally (despite the 
availability of certain mathematical techniques such as parallel transport to make useful comparisons). 
Momentum also depends on other locally defined entities (such as speed) which degrade its global status 
and significance. 


[2071 The absence of additional contribution from gravity should be inline with the fact that in general relativity gravity 
(which what distinguishes general relativity from special relativity) is a spacetime attribute and not a force. 

[208] To be more consistent we may need to say: the constancy (and non-constancy) of the speed of light in general relativity 
is meaningless at global level (due to the absence of global physical standard of calibration). 
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7.8 Energy and Work 


Due to their dependency on time and length, energy and work have no global significance in general 
relativity and hence they are well defined only locally. We should also note that in general relativity the 
paradigm of gravitational potential energy does not exist in a strict sense. 


Exercises 


1. Give an example of the relativity of work in the spacetime of general relativity (e.g. Schwarzschild 
spacetime). 


7.9 Conservation Principles 


As pointed out earlier, physical quantities and entities like time, length and direction have no absolute and 
global significance in the framework of general relativity and hence any physical quantity that depends on 
these quantities and entities (such as energy and momentum) should also lose their absolute and global 
significance. Accordingly, all the conservation principles that depend on these quantities and entities 
should become invalid (or rather meaningless) at global level.°9! This is inline with the fact that the 
conservation principles of energy and momentum are consequences of the homogeneity of time and space 
(respectively) and such homogeneity does not exist in curved spacetime. This should similarly apply to 
the conservation of angular momentum which is a consequence of the isotropy of space. 

We should also note that the conservation principles (of energy and momentum in particular) are closely 
related (at least within a classical context) to the Newton’s laws of motion and hence any (special or 
general) relativistic impact on one of these should affect the other. Also, the general relativistic paradigm 
of geodesic motion of free objects (with the incorporation of gravity in the structure of spacetime instead of 
being a force) strips gravity from its classical dynamical significance and this may facilitate the connection 
between the law of gravity and the laws of motion (as represented classically by Newton’s gravity law and 
Newton’s motion laws). Some of these issues will be touched on mildly in the questions. 


Exercises 


1. Discuss the principles of conservation of linear and angular momentum within the framework of general 
relativity. 

2. According to the relativity theories the 4-velocity (and hence the 4-momentum which is the 4-velocity 
times the supposedly-constant mass) of a free massive test object is a constant tangent vector to its 
world line which is a geodesic. Discuss this issue and speculate about its significance. 

3. Give a technical meaning to the conservation of energy and momentum of free massive objects in the 
spacetime of general relativity. 


7.10 Orbital Motion in Terms of Constants of Motion 


Motion in general is commonly described and formulated classically in terms of the constants of the 
motion, namely energy and momentum. A similar approach is followed in general relativity where motion 
is described and formulated in terms of similar “constants of the motion”. The main advantage of this 
method is its simplicity relative to other more complicated methods like solving the geodesic equations. 
Due to the exceptional importance of orbital motion this method is commonly followed in the general 
relativistic formulation of orbital motion and hence it is common in the literature of general relativity to 
develop and derive formulae for orbital motion in terms of energy and momentum. 

In the following we derive the “energy” equation for the variation of the radial coordinate r assuming 
Schwarzschild metric. This equation (which may be seen to represent the conservation of energy and 
angular momentum of freely falling massive objects in gravitational fields although this conservation is 
not strictly valid globally) describes the orbital motion of massive test object in the gravitational field of 


[209] As we will see later (refer for example to § 10.1.3), we can identify two main types of violation of energy conservation: 
one is related to global insignificance of energy and the other is related to creation and annihilation. 
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a Schwarzschild gravitating object outside Schwarzschild radius (i.e. r > 2M ). For timelike trajectory 
of a massive object we have (do) = c? (dr)* £ 0 and hence: 
(do)? = & (dr)? (260) 
F (edt)? — F~ (dr)* — r? (d0)? — r? sin? 9 (dd)? =e? (dr)? (261) 
dt \” dr \” do\” do\” 
F 2 F-1 2 2: as 2¢@ ie, = 2 262 
x (=) (F) : (=) eee (3) : ioe?) 


where in line 2 we use Schwarzschild metric (with F = 1— 2G") while in line 3 we divide by (dr)? £0. 


Now, since the massive object is free then the trajectory is a geodesic described by the system of Eqs. 
161-163. So, if we combine the system of Eqs. 161-163 with Eq. 262 (noting that in the system of Eqs. 
161-163 we have 0 = 1/2, d0/dr = 0 and u = 1/r) then on substituting from Eqs. 161 and 163 into Eq. 
262 we get: 


F712 4? ay . = ¢ (263) 

ex (*) Fe =-:¢F (264) 

2A? e) = (1 | = 2(1- 3%) (265) 

@) te aa ee 200 
2GM 


where in line 2 we multiply with F’, in line 3 we substitute (1 - ) for F’, and in line 4 we manipulate 
the equation algebraically. 

Now, physically B is the magnitude of the angular momentum per unit mass of the falling object (see § 
1.5.1), that is B = J/m. Regarding the physical significance of A we can use the Lorentzian formulation 
where we have dt/dr = y to have a clue. Accordingly, from Eq. 161 (noting that y is the energy scale 
factor as can be seen from the equation of total energy E = mc?y) we see that A is proportional to the 


energy of the object, that is: 


cr 


E=aA (267) 


where a is the proportionality factor. Now, for an object at rest at infinity (ie. dr/dr = 0 and r — oo) 
we have from Eq. 266: 

0 =c* (A*-1) (268) 
and hence A? = 1 (resulting in A = 1 since A is positive as can be deduced from Eq. 161 noting that 
dt/dr > 0 and F > 0 for r > 2a), So, from Eq. 267 (applied to such object, i.e. at rest at infinity) we 
get: 


Ey = mc? =a (269) 


where m is the invariant mass of the object and Ep is its rest energy. On substituting from Eq. 269 into 


Eq. 267 we get: 
E 
E=mc’A > A=—, (270) 
me 
where F£ is the total energy of the object. The last equation means that physically A represents the ratio 
of the total energy of the object to its rest energy. On substituting from B = J/m and A = E/ (mc?) 


into Eq. 266 we obtain: 
dr\? 2GM\ 2GM E? 
1 See 1 271 
(+) mr? ( er ) r 7 = ) am) 


Problems 
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1. Derive a similar equation to Eq. 266 for massless objects. 
Answer: We follow a similar method to the method of derivation of Eq. 266. Massless objects follow 
lightlike geodesics (or null geodesics) and hence their quadratic form (do)’ is zero, that is: 


(do)? = F (edt)” — F-! (dr)? — r? (d0)* — r? sin? 6 (dd)? = 0 


With no loss of generality we can consider the motion to be in the equatorial plane 0 = 7/2 and hence 
the last equation becomes: 


F (cdt)” — F~! (dr)” —r? dd)” = 0 
dt \” dr \? do\? 
2 -1 2 = 
2r(5) - (a) -7 (&) = 
where in line 1 we use dO = 0 and sin@ = 1 while in line 2 we divide by (dA). Now, if we substitute 
from Eqs. 164 and 166 into the last equation we get: 


A dr B? 
: ps 0 
ae dd r? 
dr\* — B? 
2 A? Be) ee 20) 
. dd 7 
dr\?— B? 2GM 
i = ew 272 
(=) =f r2 ( Cr ) er 
where in line 2 we use u = 1/r, in line 3 we multiply with F’,, and in line 4 we use F = 1 — 2GM Eq. 


272 for massless objects is the equivalent of Eq. 266 for massive objects. 
Exercises 


1. Obtain Eq. 162 from Eq. 266.!?!° 

2. Obtain Eq. 165 from Eq. 272. 

3. Derive an equation for the radial motion (i.e. motion with constant @ and constant ¢) of free massive 
object using Eq. 266. 


7.11 Effective Potential in Orbital Motion 


Following the footsteps of classical mechanics, it is customary to develop a formulation for the “effective 
gravitational potential” of a massive gravitated object in the gravitational field of a gravitating object 
according to the framework of general relativity although the concept of gravitational potential does not 
exist in general relativity in a strict sense. To do so, we start from the “energy” equation of orbital motion, 
i.e. Eq. 271. We should remark first that although the term on the right hand side of Eq. 271 does not 
represent energy it represents a quantity that is indicative of the energy of the orbiting object and hence 
Eq. 271 can be regarded as “energy” equation for the orbital motion. Now, if we divide Eq. 271 by 2 we 


obtain: 
1fdr\* 2GM\ GM ce ( EP 
1 = 1 273 
2 (F) 2m?r2 ( er ) r 2 (5 ) ey) 
1 2 2 M M 2 2 E? 
dr J G GM J _¢ 1 (274) 
2\dr 2m2r2 r c2m?2r3 2 \m?2c4 


[210] Ty fact, this should serve as verification for the derived energy equation. 
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To have a better insight into the significance of Eq. 274, let compare it with its Newtonian counterpart. 
In Newtonian mechanics the energy equation for orbital motion is given by: 


1 (dr \? 
5($) i eo (275) 
1 (dr\? J2 GM 
= p. 2 
2 (=) QWm2r2 r (276) 


where Vy is the Newtonian effective potential (which is characterized by being a function of r only for a 
fixed J) while EF, is the total energy of the orbiting object per unit mass (i.e. E//m). So, if we tolerate 


the difference between E,,, in Eq. 276 and = ( Be 1) in Eq. 274 (as well as the difference between t 


m2ct 
and 7 and the difference between the radial distance r and the radial coordinate r) then Eqs. 274 and 276 


are identical apart from the term Sous (which is also a function of r only for a fixed J). Accordingly, 
if we extend the paradigm of “effective potential” (which is a classical paradigm) to general relativity then 


the general relativistic effective potential V¢z in orbital motion is given by: 


nie GM GMJ? 


V, = 277 

ae 2m2r2 r c2m?2r3 an) 
GM J? 

Ver = Vn- aaa (278) 


Exercises 


1. Discuss briefly gravitational potential energy in general relativity. 
2. Use the equivalence principle to argue that gravitational potential energy has no real meaning in 
general relativity. 


7.12 Radial Trajectories in Spacetime 


In this section we briefly investigate the general relativistic formulations for the radial trajectory of 
massless and massive free objects where we are assuming (as usual) Schwarzschild geometry.!?"!!_ These 
formulations are usually given as implicit correlations between time (¢ or T) and radial coordinate r, i.e. 
t(r) or T(r). For massless object the Schwarzschild quadratic form becomes (noting that for massless 
object do = 0 and for radial trajectory dé = dé = 0): 


(1 = *s) (cdt)” — (1 = *s) - (dr)? = 0 (279) 


e (F) : (1 - *s) es (280) 
of = + (1 - oy (281) 


where the plus sign is for motion away from the gravitating object (since r increases as t increases) while 
the minus sign is for motion toward the gravitating object (since r decreases as t increases). On integrating 
the last relation we obtain: 


I 


ct =+ [r+ Rg ln(r — Rsg)| (282) 


where the (arbitrary) constant of integration is set to zero and r > Rg. The last equation implicitly 
correlates the radial coordinate r to the temporal coordinate t. It is noteworthy that Eq. 282 indicates 
that infinite coordinate time is required for a massless object to fall to the event horizon of a black hole 
(noting that coordinate time represents the proper time of an observer at infinity). 


[211] Tm fact, we also restrict our attention in the following formulations to the region r > Rg (to avoid the mess of black 
holes at and inside the Schwarzschild radius). 
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Xd _D 
X] 


Figure 3: A simple sketch representing two points (or events) in spacetime (x; and x2) on two adjacent 
geodesics (C; and C2) where these points are connected by a displacement 4-vector D (i.e. D4 = xf —2''). 
The geodesics C and C2 are parameterized by a single natural (or affine) parameter \ and the points x1 
and X2 correspond to the same value of A. 


By a similar argument, it can be shown (see Exercises) that the radial trajectory of massive object 
falling freely from rest in the Schwarzschild spacetime is given (implicitly) by: 


PM +3Rsvr | Rs) (vr-VRs\| | 
some oe (Ave) +e os 


where C' is a constant (which can be set to zero). 


Exercises 


1. Show that Eq. 282 is obtained by integrating Eq. 281. 

2. Find an analytical expression for the proper time (as a function of r) for a massive object dropped 

from rest at an initial radial coordinate r; far away from the source of gravity (and hence it is freely 

falling radially toward the gravitating object). 

What is the 4-velocity of the object in the previous exercise? 

4, Find an analytical expression for the coordinate time (as a function of ) for a massive object dropped 
from rest at an initial radial coordinate r; far away from the source of gravity (and hence it is freely 
falling radially toward the gravitating object). 


oo 


7.13 Geodesic Deviation 


In general relativity, geodesic deviation essentially describes and quantifies the relative acceleration be- 
tween two freely falling particles along two adjacent geodesics. In a sense, it is a replacement of the 
classical paradigms of field gradient and tidal force and acceleration. Let have two neighboring geodesics, 
C; and C2, and they are parameterized by a single natural (or affine) parameter » (see Figure 3). Let 
D(A) be a displacement 4-vector that connects two corresponding points (i.e. having the same \ value) 
at(X) on C; and xf (X) on C2.?"41 The geodesic deviation for these points as \ varies is then given by 
the condition (which quantifies the change in the displacement 4-vector as a function of A): 


6° D® 4 Re aod 


pie ee a ae Ge) 


[212] To make an analogy and avoid confusion we call D" displacement vector although it may be more appropriate to call 
it deviation or separation vector. 
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where Fae symbolizes the second order absolute derivative, Regn 18 the Riemann-Christoffel curvature 
tensor of the spacetime, and x°, x7 are the spacetime coordinates of the points on the geodesics and where 
all these quantities correspond to a given value of X. It is obvious that the equation of geodesic deviation 
solely depends on the metric tensor of spacetime and hence it is intrinsic. We note that the equation 
may be written in a number of different forms (e.g. ope = 2 2 ae wel) using the anti-symmetry 
properties of the Riemann-Christoffel curvature tensor (and this is the main reason for appearing in the 
literature in confusingly different forms). We should also note that the equation is a tensorial relation 
and hence the “tidal gravitational field” is a tensor (even though the gravitational field itself is not since 
it can be transformed away and vanish by free fall according to the equivalence principle).?13 


Exercises 
1. Analyze the equation of geodesic deviation (i.e. Eq. 284) in flat spacetime. 
2. Derive the equation of geodesic deviation (i.e. Eq. 284). 


3. The general relativistic paradigm of geodesic deviation is commonly considered in the literature to be 
a substitute for the classical paradigms of gravitational gradient and tidal force. Comment on this. 


[213] In fact, some of these issues can be questioned. 


Chapter 8 
Consequences and Predictions 


In this chapter we investigate the theoretical consequences and predictions of the theory of general rel- 
ativity. We will also investigate in some cases how these consequences and predictions are derived from 
the theory and how they compare to the corresponding theoretical predictions of classical physics. The 
chapter will also include solved problems and exercises about the applications of these theoretical results. 


8.1 Perihelion Precession of Mercury 


The precession (or advance) of the perihelion of Mercury is an orbital phenomenon caused by the slow 
change in the orientation of the long axis of the Mercury orbit. Accordingly, the perihelion does not 
have a fixed position in space relative to the Sun but it gradually shifts by rotating around the Sun and 
hence the orbit does not close (see Figure 4). The perihelion position of Mercury advances by about 5600 
arcseconds per century; most of which arise from precession of equinoxes and from perturbations caused by 
gravitational interaction with other planets in the solar system. Classical mechanics can explain almost 
all this 5600 arcseconds advance except 43 arcseconds per century.!?!41 Calculations based on general 
relativity can accurately predict this extra precession and hence almost all the remaining 43 arcseconds 
can be accounted for by general relativity. 

Although the extra precession of perihelion in planetary motion is commonly regarded as a purely 
general relativistic effect, there are Newtonian formulations of this precession although some of these 
formulations may not agree quantitatively with the prediction of general relativity. In this context, it is 
worth noting that the general relativistic formula that correctly predicts the extra precession of Mercury 
perihelion has been derived earlier in the late 19°” century by a German school master called Gerber using 
a classical approach (known as retarded gravitational potential) and hence general relativity is not the 
first or the only theory that could explain this extra 43 arcseconds advance. The reader is referred to the 
paper of Ian McCausland “Anomalies in the History of Relativity” for more interesting details about this 
issue and the debate surrounding it. It should be noted that this formula has also been obtained later 
from non-classical gravitational theories other than general relativity (e.g. the theories of Whitehead and 
Birkhoff). 

We should also note that we title this section as “Perihelion Precession of Mercury” specifically because 
of its historical and observational significance. Otherwise, it is just an instance of the periastron|?!®] 
precession that is predicted by general relativity to occur in any orbital motion whether of Mercury or 
of another planet or indeed of any orbiting object. Accordingly, the following investigation and analysis 
should have general applicability (within the stated conditions) although it is largely phrased and presented 
in the context of Mercury and planetary motion. 

Using the geodesic equations in the Schwarzschild metric (ie. Eqs. 193-195) it can be shown (see 
Problems) that the extra precession per revolution in the orbit of a gravitated object in the gravitational 
field of a static and spherically symmetric gravitating object is given by: 


2 BACT? 
~ Inc2a4 (1 — e?) 


dg (285) 
where 6¢ is the extra precession per revolution in radians, G is the gravitational constant, M is the mass 


of the gravitating object (i.e. the Sun in the case of planetary motion), T is the orbital period of the 


[214] We are considering the commonly recognized classical theory and hence we do not include marginal theories that claim 
to explain this extra 43 arcseconds classically. 

[215] Periastron is similar to perihelion but perihelion is specific to the Sun (i.e. Helios in Greek and hence helion) while 
periastron is for stars in general (and hence astron). 
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Figure 4: A simple sketch demonstrating the precession of perihelion PP of planetary orbit around the 
Sun (where P; and P2 are two consecutive perihelia). 


gravitated object, c is the characteristic speed of light, a is the mean distance between the gravitated 
object and the gravitating object,|?!° and e is the eccentricity of the orbit. 

The above formula gives the extra precession per revolution. So, to calculate the extra precession per 
century we multiply the extra precession per revolution by the number of revolutions per century where 
this number can be easily obtained from dividing the time of a century (expressed in a suitable time unit 
like second) by the time of orbital period T (expressed in the same time unit). Accordingly, the extra 
precession per century is given by: 


_ 3M? 8M? _ 8G? MTT, 
— Inctat (1 — e?) ©" Inctat (1—e?) ~ T  2nc%at (1 — e?) 


bbe (286) 
where 6¢, is the extra precession per century in radians, N, is the number of revolutions per century and 
T, is the time of a century. For example, if T and TJ, are in days then T, = 36525 days. 
2 2 
As we see, Eq. 285 is made of a product of a common factor (i.e. 234") and a planetary factor 


Qe 
that distinguishes each planet from the others (i.e. at): Similarly, Eq. 286 is made of a product 


of a common factor (i.e. a zi and a planetary factor (i.e. ates) where this planetary factor is a 


product of the previous planetary factor (i.e. oie) and the number of revolutions per century (i.e. 
N.). Accordingly, the relative size of the extra precession per revolution is determined by the size of the 
planetary factor (i.e. at) while the relative size of the extra precession per century is determined by 
the size of the aforementioned planetary factor and the number of revolutions per century. 

In Table 1 we present the orbital parameters (specifically T, a and e) of the planets in the solar system 
and calculate the extra precession per revolution 0¢ and the extra precession per century d¢, (using Eqs. 
285 and 286). As we see, both these extra precessions are decreasing according to the order of the planets 
in their proximity to the Sun. 


Problems 


1. Derive the general relativistic formula for the extra precession of planetary perihelion in the classical 
limit, and show that the predicted precession by this formula is almost identical to the observed pre- 
cession of perihelion of Mercury. 

Answer: The precession of perihelion is a planetary motion problem and hence it should be investi- 
gated as a geodesic problem in the Schwarzschild metric. So, it is closely related to our investigation 
in § 4.1.2 and hence we can use the results that we developed in § 4.1.2. More specifically, we can 
start from Eqs. 193-195 in problem 3 of § 4.1.2. So, if we use the Schwarzschild quadratic form that 


[216] The mean distance is represented by the semi-major axis of the ellipse that approximates the shape of the orbit. 
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Table 1: The planets in the solar system with their orbital parameters (T,a,e) and extra precessions 
(6, 6¢-) according to Eqs. 285 and 286 where T is the orbital period (in days), a is the mean distance 
(in 10° m), e is the eccentricity of orbit, 5@ is the extra precession per revolution (in arcseconds) and 6¢. 
is the extra precession per century (in arcseconds). For the purpose of comparison, we also include the 
number of revolutions per century N,.. We note that the numbers in this table are not highly accurate. 


L a e od Ode Ne 
Mercury 88 57.9 | 0.206 | 0.1035 | 42.9515 | 415.06 
Venus 225 108.2 | 0.007 | 0.0530 | 8.61620 | 162.55 
Earth 365 149.6 | 0.017 | 0.0383 | 3.83293 | 100.01 
Mars 687 227.9 | 0.094 | 0.0254 | 1.35031 | 53.17 


Jupiter 4331 | 778.6 | 0.049 | 0.0074 | 0.06208 | 8.43 
Saturn 10747 | 1433.5 | 0.057 | 0.0039 | 0.01342 | 3.40 
Uranus 30589 | 2872.5 | 0.046 | 0.0020 | 0.00237 | 1.19 
Neptune 59800 | 4495.1 | 0.011 | 0.0013 | 0.00077 | 0.61 
Pluto 90560 | 5906.4 | 0.244 | 0.0010 | 0.00042 | 0.40 


we developed in that problem (i.e. Eq. 196 to be specific) then we have: 
ae. Fo! (dr\? dg) _ 
dt ce \dr c? \ dr 
PF A\? F-! (dr\? r2 (BY? ae 
F ce \dr c2 \ r 


Aes eet ei ne 
F c2 \dr Cr 
At es (2 ay BY a2 
F ce? \ dé dr Cr 
A? Fo} (2) BP ok 
F c2 \ddr? Cr? 
Ae Fly? ) Be sh 
F crt \dd Cr 
ce? A2r4 ( dr ) Pr c?F rt 
B? dd B? 
2 2 pd 2 424 
(=) = -< = = ~~ Fr? (287) 
where in line 2 we use Eqs. 193 and 195 (noting that u = 1/r), in line 3 we simplify, in line 4 we use 


the chain rule, in line 5 we use Eq. 195, and in line 7 we multiply by eae Now, since u = 1/r (and 


hence r = 1/u) then we have: 


dr 1 du 


and thus Eq. 287 can be written as: 
du 2 _ Fr ie Art a 
u2 do B? B? 


ud 


1 (du? 7 CF oA F 
dé BeuA Beut ou? 
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2 
a = eF oA — Fu? 
do B? B? 
du\? (ex 2GMu\ _CA* 4 ; 2GMu 
dd > B? C2 Ppa se c 
du\* @ 2GM. @A 4. 9GM s 
=— = t uc U 
dé Be Be RB * @ 
du\* IGM» - 5 96M eee 
dd = 2 i ae a + BR? 
2 2 2 4 A A2 
2GM A 
cy cae ey ane ee (289) 
do Ce 2GM B? 2GMB2 2GMB? 
where in line 2 we use r = 1/u and in line 4 we use F = 1 — sl “ while the other lines are based 


on simple algebraic manipulation. As we see, the expression inside the parentheses on the right hand 
side of Eq. 289 is a cubic expression in u and hence according to the rules of polynomials it can be 
factorized into three factors where each factor corresponds to one root of the cubic.!?!71 Now, the 
roots of the above cubic should include ug and u; that correspond to the perihelion distance rg and 
the aphelion distance r; respectively. The reason is that at perihelion and aphelion rp and rj, are 
minimum and maximum respectively and hence or = 0 which (according to Eq. 288) makes oe =0 


and this should cause the cubic in Eq. 289 to vanish (since 2“ 4 0) which means that uo and uy 


are roots of this cubic. Now, according to the rules of polynomials that govern the relation between 


the coefficients and the roots the sum of the roots of the cubic in Eq. 289 is = Pt Therefore, the 


third root should be 72— — ug — u,. Hence, Eq. 289 can be written as: 


IGM 
du\? 2GM e 
(=) =i tre (u — uo) (u— uw) (u- sey tte) 
du\? 2GM 2GM_ 2GM 
(=) = (u—Uo)(u ua) ( 2 u—14 2 Uo + 2 u) 
2 
(=) = (u—upo) (u ua) ( 147m lu + uo 4 ul) 
2 
(Se) = w= uum) (1-7 fu tuo + unl) 
1/2 
. = Vo =) (w= tH) (1~ 7 fu + uo + ul) 
dé) —  (1— 74M [ut uo + wi)? 
dul / (uo — u) (u = U1) 
dd 1+ SY [w+ uo + ui] 
See) ee c 290 
du / (uo — u) (u = ur) a 


where in line 5 we have (uo — u) > 0 and (u— uy) > 0 because up is maximum (since rp = 1/uo is 
minimum) and wu; is minimum (since r; = 1/u; is maximum) and hence (1 — 2S“ [u+ uo + ui]) > 0 
according to line 4, while in line 7 we use a truncated power series (or binomial series) approxima- 


tion./?19] 


[2171 Tf the roots of a cubic polynomial in x (with a leading x term) are 21, £2,23 then the polynomial can be factorized as 


(a — 21) (w@ — 2) (w@ — 23). 


[218] For a cubic of the form x3 + ba? + cx + d with roots x1, 72,23 we have 21 +22 +23 = —b. 
[219] We have: 
1/2 1 32 5 3 
(1— 2) =1l+-@+—-"° + —2°+--- (-l<a2<1) 
2 8 16 
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Now, our objective from developing a general relativistic formula for the extra precession of planetary 
perihelion is to find the extra change in ¢ caused by a change of u during a complete revolution of the 
planet around the Sun. In practical terms, we need first to find A@ which represents the total change 
in ¢ during a full revolution (i.e. when the orbit changes from uo to uy then back to ug) and then 
we find 6¢ which is the extra change in ¢ (i.e. the excess over 27) and this d¢ is the required extra 
precession. So, we should first find Ad when the orbit changes from up to uw; then back to uo (ie. 
Up + U, — Ug) by integrating |d¢], that is: 


= cg ft Lt BS lw to tu] 
Ke / Pree af ae (291) 


where in the second step we used Eq. 290 plus the presumed symmetry of the orbit.22°! To evaluate 
the integral in the last equation we need to find an expression for u which means that we need to have 
a solution for the planetary motion problem where u is given as a function of ¢.1??!] Now, in § 5.2 
(see problem 1) we demonstrated that in the classical limit the general relativistic formulation for the 
planetary motion reduces to the classical system of Eqs. 15-16, while in § 1.5.1 (see problem 2 and Eq. 
17) we found that the solution of the classical system of Eqs. 15-16 is: 


1 e/A Be? 
a = 


uw ecos@+1 GM (ecos¢ + 1) 


where A is a positive constant and e = ae Now, since in the present problem we want to derive the 


general relativistic formula for the extra precession of planetary perihelion in the classical limit for the 
purpose of comparison to observation (since our observations of planetary motion fall well within the 
classical limit) then we can use this solution as a good approximation. Also, in § 1.5.1 (see problem 
3) we found that: 


1 B? aaa 1 B? 
r = n n= = 
on up) GM (1 +e) oT Gi CMTS) 
Accordingly: 
GM GM |GM (ecos¢+1 GM (1+e GM (1l-e 
1+—y lutu tml = l+—3 | (eon dy o dy c ) 
G? M? 
= 1+ apr (ecos¢ + 3) 
GM (ecos@+ 1 GMe,. 
jau) = fa (eer) | — SO sin at 
GM(1l+e) GM(ecos@+1) GMe(1—cos¢) 
Ug B2 B2 = B2 
GM (ecosé6+1) GM(1l—e) GMe(1+4+cos¢) 
Ne B Be B 
GMe -——~— GMe,. 
V (uo “> u) (u —_ u1) = B2 Gee? ? = B2 sin d| 


[220] Although the orbit is not elliptic (because it is not closed) it can still be symmetric (considering a full revolution) or at 
least it is almost symmetric (if it is not exactly symmetric). 
[221] Tn fact, we also need to find expressions for ug and uj, (but they are based on the expression of wu). 
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On substituting from the last equations into Eq. 291 we get:!???! 


=m oe (ecos¢ +3) GMe 
2 GMe |; B2 
=0 Bz [sin | 


l2 


Ag lsin ¢| do (292) 


o=n G2 M2 


IG?M7e.. 1, G?M? 
= 7 + ga Be [sin ¢]5 + “2B °™ 
61G? M? 
c? B 
67G? M? 
c? B2 
= 2r+d¢ 


where in line 1 the use of dd (instead of |d¢|) is justified by the limits. As we see, during a complete 
revolution of the planet around the Sun (i.e. when the orbiting planet undergoes a change from a 


perihelion rp to an aphelion r; then back to the next perihelion rg and hence it completes a full 
6nG? M? 
eB 


= 274+0+ 


= w+ 


revolution) there is an extra precession 6¢@ in the planetary orbit of 


dA tab. 2na?/1 — e? 
dt T Ali 


where we used formulae that we developed earlier (see § 1.5.1). Therefore, the extra precession in the 
planetary orbit in each revolution is: 
fie 61G? M? $3 61G? M? 7 61G? M?T? 3G? M?T? 


c? B? 2 (er ee), c?4n2a4 (1 — e?) ~ Oat (1 — e2) 
zie 


. Now, we have: 


which is the same as Eq. 285. 
Regarding the observed extra precession of perihelion of Mercury, the time period T of Mercury is 
about 88 days (i.e. about 7.6032 x 10° s), its semi-major axis a is about 5.790905 x 10!° m, and its 
eccentricity e is about 0.2056. Hence, the extra precession of its perihelion in one revolution d6¢ 4 is: 
3 x (6.674 x 10-!)* x (1.989 x 103°)” x (7.6032 x 108)? 
bom = 2 4 
2m x (3 x 108)~ x (5.790905 x 101°)” x (1 — 0.20567) 


Now, in one century (i.e. 100 years) there are 36525 days and hence Mercury should revolve around 
the Sun about 415 times. Therefore, the extra precession of its perihelion in one century is about: 


415 x 5.018 x 107’ rad © 0.0002083 rad ~ 42.95 arcsec 


~ 5.018 x 107’ rad 


The observed extra precession of perihelion of Mercury is about 43 arcseconds per century (with 
a reported uncertainty of about 1%). Hence, the agreement between the observed value and the 
prediction of general relativity is excellent (but see the following note and § 9.1). 

Note: in the above derivation and calculation we used the formula: 


dA mab = 2ma?v/1 — e? 

dt T T 
In fact, this formula is valid for flat classical spacetimel??3l (which is not the case in general relativity) 
and this will put a big question mark on the validity of the above derivation and calculation. Accord- 


ingly, the “excellent” agreement between the observation and general relativistic prediction is highly 


[222] Regarding the change of the limits of integration, refer to the note in the end of this answer. 

[223] This may be seen more prominently in the final formula where we use a (which is a physical distance in flat space) 
rather than a quantity like r (which is a coordinate variable that usually appears in general relativistic formulae in 
curved spacetime). In fact, all the quantities A, t,a,b,T as well as the relation A = zab are classical and belong to a 
flat space-time (as can be noticed from importing them from the classical formulation of § 1.5.1). 
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suspicious.|??4] In fact, if we take the effect of the curvature of spacetime (by observing the difference 
between coordinate variables and physical variables) then there should be no agreement (let alone be 
excellent). This issue will be investigated further in § 9.1 where we will clarify the issue that being in 
the classical limit (as demanded by the question and followed in the derivation) does not justify the 
neglect of the effect of spacetime curvature due to the violation of locality and the accumulation of 
tiny relativistic effects over extended tempo-spatial interval (see exercise 8). 

We should also note that in changing the limits of integration between Eq. 291 and Eq. 292 we 
are also using an approximation as if there is no extra precession.!??5]_ This could be an acceptable 
approximation for one revolution but it may not be acceptable for many revolutions when we calculate 
the extra precession in a century. This should be a cause for more uncertainty about the “excellence” 
of the agreement between the observation and general relativistic prediction. 


Exercises 


1. What “perihelion” and “aphelion” mean? 

2. What “precession” means? Is it appropriate term for describing the slow shift in the position of 
perihelion? 

3. What we mean by “extra precession”? 

4. What are the main causes of the precession of perihelion of Mercury that can be accounted for by 
classical physics?|??6) 

5. What distinguishes Mercury from other planets in the solar system and hence makes it prominent with 
regard to its extra precession of perihelion? 

6. What you note about 6¢, d¢, and N, in Table 1? 

Illustrate the precession of perihelion by a simple sketch. 

8. As we noted earlier (see § 1.1), the extra precession of perihelion of Mercury may be seen as an exception 
to the common rule that the characteristic general relativistic effects that distinguish general relativity 
from classical gravity are generally observable in strong gravitational fields. Discuss this issue. 


N 
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Light bending (or light deflection) by the gravitational field of a gravitating body, like the Sun, is one of 
the major predictions of general relativity because according to this theory the bending of spacetime by 
matter and energy should result in bending the geodesic paths of the spacetime and hence the physical 
objects, whether massive like material particles or massless like photons, which follow these geodesic paths 
in their free fall (i.e. in the absence of forces other than gravity) should move along curved trajectories. 
The bending of light in gravitational fields may also be justified qualitatively by the equivalence principle 
since light bends in accelerating frames and hence it should also bend in gravitational frames. In fact, 
light bending by gravity can be derived even from the Newtonian theory (see Exercises) and hence it is 
not an effect that is specific to general relativity. However, the Newtonian prediction is quantitatively 
different from the prediction of general relativity.|??7! 

Using the geodesic equations of null geodesic in the Schwarzschild metric, it can be shown (see Problems) 
that the deflection of light by the gravity of a static and spherically symmetric gravitating body is given 
by: 


(293) 


where 6¢ is the angle of deflection in radians, G is the gravitational constant, M is the mass of the 
gravitating body, c is the characteristic speed of light and d is the distance of closest approach of light to 


[224] The suspicion is actually not about the agreement but about the general relativistic nature of the prediction. 

[225] Noting that the total extra precession in a revolution is an accumulative contribution of tiny precessions over the entire 
orbit, this approximation should be acceptable considering that 5¢ is very tiny compared to full revolution (i.e. 27). 

[226] As indicated earlier, we are excluding here classical theories that claim to account for the extra precession. 

[227] We are referring here to the commonly recognized Newtonian prediction which is based on derivations like the one that 
we provided in the Exercises. However, some theories based on the Newtonian gravity seem to have different predictions; 
some of which may agree with the prediction of general relativity. 
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Apparent position 


Figure 5: A simple sketch depicting the bending of a light ray from a star as it passes by the Sun toward 
the Earth and hence it is deflected by the gravity of the Sun. 


the center of the gravitating body. 

Light bending by the gravitational field of a gravitating body may be demonstrated by a number of 
astronomical phenomena. For example, it is demonstrated in the bending of light rays originating from 
stars by the gravitational field of the Sun which results in the apparent displacement of the stars from 
their real position (see Figure 5). Because the faint light of stars cannot be observed in the close proximity 
of the bright light of the Sun, these light bending observations are usually conducted during solar eclipse 
where the bright light of the Sun is masked by the Moon. A historical example for this sort of light 
bending observations is the famous 1919 expedition by Eddington and his team which led to the general 
acceptance and dominance of general relativity and the rise of Einstein to worldwide fame. 

Light bending by the gravitational field of a gravitating body may also be demonstrated by a number of 
astronomical phenomena that are generically labeled as gravitational lensing where a gravitating object 
(such as a black hole or a galaxy or a cluster of galaxies) acts like a lens by gathering light rays from an 
observed object along multiple paths around the gravitating (or lensing) object and directing them toward 
the observer and hence the observed object appears with multiple images surrounding the gravitating 
object. For instance, it may be demonstrated by the so-called Einstein cross where 4 images of an 
astronomical object, such as a quasar, are formed by the deflection of the light rays that originate from 
the object and follow 4 different paths in their way around the gravitating object. Gravitational lensing 
may also be demonstrated by the so-called Einstein ring which is similar to the Einstein cross but instead 
of forming 4 separate images a ring is formed by a continuous bundle of rays that are symmetrically 
distributed around the lensing object. Einstein ring is supposed to be formed when the observer, observed 
object and the lensing object are aligned along a straight line and hence the coming rays form a circular 
symmetry around the lens, while Einstein cross is supposed to be formed when they are not aligned. 
However, the formation of a cross (rather than other shapes) requires further justification (see § 9.2). 


Problems 


1. Derive the general relativistic formula for light bending by stars (e.g. Sun). 

Answer: The spacetime surrounding a star (as a gravitating body) is reasonably described by the 
Schwarzschild metric and hence we should use this metric here. Moreover, the problem of light bending 
is a problem of motion along geodesic trajectories in spacetime because the light is in free fall under 
the influence of gravity alone. Therefore, this problem should be investigated as a geodesic problem 
in the Schwarzschild metric and hence we can use the results that we obtained in § 4.1.2. Now, since 
light is massless the system of geodesic equations that is suitable for investigating this problem is the 
system of Eqs. 164-166. In fact, all we need from that system is Eq. 165 because the essence of light 
bending is represented by a correlation between r (= 1/u) and ¢ (i.e. how ¢ changes as r changes) 
and that is what Eq. 165 is about. So, our investigation here starts from Eq. 165, that is: 


d?u 3G Mu? 
tthe (294) 
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Figure 6: A simple sketch depicting the deflection of a light ray LR emitted from a light source LS as the 
ray passes by a gravitating body GB (which is at the origin of coordinates) toward a light detector LD 
and hence the light ray is deflected by an angle 6¢ causing the image of LS to be visually displaced and 
thus it is seen at an apparent location AL. The other symbols in the sketch are explained in the text. 


In brief, what we need for finding a formula for light bending by gravity is to solve Eq. 294 and hence 
infer the angular deflection in the path of light that is caused by the gravitational field of the star. 
Now, if we consider a light ray grazing a typical star that represents a gravitating object then we have 
u> ae This can be verified by taking the ratio of seu to u and using the Sun to represent 
the typical star, that is: 


8GMuw2 — 3GMu 
2 


38GM 3x 6.674 x 10-™ x 1.989 x 10°° 


ae : ~ 6.362 x 107° 
Ge (3 x 108)” x 6.9551 x 108 


c2u Cc 


where r represents the radius of the Sun since the ray is grazing (in fact r should be radial coordinate). 
3GMu? 
c2 


Therefore, as a first approximation the term can be ignored in solving Eq. 294 and hence we 


solve the equation oe +u =0 instead. The solution of the latter equation is (see Exercises): 


1 cos¢ T T 
- - <<) 295 
Gaca oo 

where d is a constant. As we will see, d represents the distance of closest approach of the light ray to 
the center of gravitating object and hence in the case of grazing ray it represents the radius (or radial 
coordinate) of the gravitating star. We should note that the solution can also be given as u = sine 
with 0 < ¢ <7. In fact, for a polar coordinate system in a standard form (i.e. where ¢ is the angle 


that re, makes with the positive x axis of the corresponding Cartesian system) u = cose represents 


a straight line parallel to the y-axis and intercepting the x axis at d (see Figure 6) while u = sine 
represents a straight line parallel to the z-axis and intercepting the y axis at d. 

Now, Eq. 295 is an equation of a straight line in polar coordinates (see Exercises) with d being 
the perpendicular distance of the straight line from the origin of coordinates where the center of the 
gravitating object is located (see Figure 6) and hence our first approximation produces no deflection. 
However, we can seek a better solution by substituting this approximate solution into the u? term of 
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Eq. 294, that is: 


It can be easily verified (see Exercises) that a solution of the last equation is: 


cos b " GM (2 — cos? ¢) 
d cd? 


Now, to find the total deflection of the light ray during its journey from infinity (i.e. r > co) on one 
end to infinity on the other end (i.e. from u = 0 at the source of light to u = 0 at the detector of light), 
we set u = 0 in the last equation and find the difference between ¢ at one end and ¢ at the other end, 
that is: 


cos@¢ GM (2 — cos? d) 


0 
d cd? 
cosé | 2GM — GM cos” é 
d ° ¢d? cd? 
G 2GM 
ad cos? @ — cos ¢ ag = 0 (296) 
Now, if we note that fou cos? @ is very small for a light ray grazing a typical star (see Exercises) then 
we can ignore this term and hence we get an approximate equation: — cos ¢— 2 ~ 0 whose solution 
is: 
2GM 
Au os 297 
cos b 2d (297) 
and hence: oCM OGM 
T T 
~ = d ~~ 298 
o1 5 ay an 2 5 ala (298) 


These two possible values of ¢ correspond to the two ends of the light ray (i.e. source of light at 


infinity on one end and detector of light at infinity on the other end). In fact, the value ¢2 ~ 5 — ao 
corresponds to @ ~ —5 — aa or similarly to ¢ ~ 37 — el (see Figure 6) and that is why it does 
not give the correct sign. More justification is given in the upcoming note 2 in the end of this answer. 
Now, if we take the difference between @ at infinity on one end where the light source is located and 
@ at infinity on the other end where the light detector is located (see Figure 6) then we get the total 


deflection, that is: 


am 22GM at 2GM 4GM 
ere Ta rae cd E cd ) cd 


which is the general relativistic formula that is given in the text (see Eq. 293). 

Note 1: the reader may feel uncomfortable about the last approximation where we discarded the 
term ou cos? ¢. So, we show in this note that this approximation is very good. Eq. 296 (which can 
be written as GM cos? ¢ — c?dcos¢ — 2GM = 0) is a quadratic equation in cos ¢ and hence from the 
quadratic formula we get: 


edt Vad + 8G2 M2 
IGM 


It is obvious that the root with plus sign is unacceptable because it is too large and hence it should be 
rejected since —1 < cos¢@ < +1. Now, if we consider a grazing light ray and use the Sun as a typical 
star then from the approximate root (i.e. Eq. 297) we get: 


cos @ = 


(299) 


2GM 2x 6.674 x 107" x 1.989 x 10% 
ed — (3 x 108)* x 6.9551 x 108 


cos ¢ ~ ~ —4,2413596 x 107° 
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while from the exact root (i.e. Eq. 299 with the minus sign) we get: 


Cd— Vd? + 8G2 M2 x 28 
cos @ = 2GM ~ —4,2413599 x 10 

As we see, the approximate root is virtually identical to the exact root. 

Note 2: the transition from Eq. 297 to Eq. 298 may not be obvious. However, we should note first 
that we have two different solutions because the original equation is quadratic and although we already 
rejected one root on mathematical basis we should admit the existence of two physical roots. In fact, 
we can demonstrate formally that ¢, and 2 of Eq. 298 are the principal solutions, but instead we 
can follow a simpler approach by using the trigonometric identity cos (a + 3) = cosacos BF sin asin 3 
with the sine approximation of small angle (i.e. sin@ ~ () to conclude that ¢) and ¢2 in Eq. 298 
produce Eq. 297 (and hence ¢; and ¢ are solutions to Eq. 297), that is: 


tT 2GM T 2GM _ mw, 24M 2GM 
cos¢@,; = cos (3 + “) = COs 5 cos — 5 sin > sin 37 =~ 2a 

38x 2GM 37 2GM . 38a , 2GM 2GM 
COS 2 = Cos (= = ) = COS 2 Cos ad + sin sin ed & ad 


where we used the aforementioned alternative expression for ¢2 (to obtain the correct sign) and where 
the smallness of St (and hence 2S“) is justified in exercise 7. 


Note 3: the provision u > salir can be justified more generally by the following: 


3GMu2 _3GMu _3GM _32GM__ 3Rs 


2 er 2 er 2r 


c2u Cc 


where Rg is the Schwarzschild radius and r represents the star radius (or radial coordinate) or larger. 
Now, for ordinary stars Rg < r and hence the provision is justified. The reader should note that the 
“erazing” condition that associates this provision (and its alike) in the above derivation represents the 
worst case and hence the provision becomes stronger in the non-grazing case. 

Note 4: the validity of the above derivation should be restricted to the weak gravitational field 
case where the spacetime is approximately flat. This may be suggested by a number of indications 
such as defining d as “distance”, using polar coordinates (also see exercise 5) and accepting straight line 


solution as a first approximation (which is based on u >> sore and the smallness of eM). In fact, this 
restriction should be suggested by the question itself since “stars” usually generate weak gravitational 
fields (if we exclude neutron stars). This is also suggested by the use of terms like “typical star” and 
“ordinary star” (as well as other indications in the solutions of the upcoming Exercises). 

Note 5: the above derivation is based on a number of approximations and compromising assumptions 
(e.g. classical gravitational systems of weak gravity). Accordingly, the derived formula should have 
limited validity from these perspectives (at least as far as this derivation is concerned). 

Note 6: we defined d as the distance of closest approach. But considering the initial straight-line 
solution, it should be more appropriate to define it as the distance of impact (or impact parameter) 
where the supposed straight line is defined by the initial direction of the ray. However, we adopted 
the “distance of closest approach” definition for convenience in the derivation and application (as well 
as possible ease of comparison to the upcoming classical derivation; see exercise 8). Anyway, for 
the considered weak gravitational field case (where the deflection of light is very tiny) the difference 
between the two definitions should be practically negligible. 


Exercises 


Li 
2: 


Is light bending by gravity restricted to light (i.e. photons)? 

Discuss the following statement: “The bending of light in gravitational fields may also be justified 
qualitatively by the equivalence principle since light bends in accelerating frames and hence it should 
also bend in gravitational frames”. 

What is the significance of light bending by gravity? 
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4. Show that u = cond is a solution to the equation a +u= 0. 


17. 


Verify that (a) the polar equation 4 = cone (with —§ << 4) is an equation of a straight line parallel 
to the y axis, and (b) the polar equation 1 = sine (with 0 < ¢ < m) is an equation of a straight line 
parallel to the x axis. 
Verify that: 

cos@¢ GM (2 — cos? o) 


ust ae (300) 


is a solution to the following equation: 


dtu 3GM cos? ¢ 
ape aces (301) 
GM 


ede 
Show that for a light ray grazing a typical star, 737 cos? # is very small (i.e. close to zero). 

Derive a classical formula for light bending by stars (e.g. Sun) and hence show that the classical 
deflection is half the general relativistic deflection. 

Since light is massless and classical gravity is a property of mass (according to Newton’s gravity law), 
how can light be deflected by gravity in classical physics? 


. Calculate the deflection of a light ray grazing the Sun. 
. What is the significance of the apparent displacement of the image of an observed object when the 


light originating from the object bends? 


. Do you think “gravitational lensing” is a suitable term for the described phenomenon? 

. Make a simple sketch depicting gravitational lensing. 

. What can gravitational lensing reveal about the lensing object and the lensed (or observed) object? 

. Show formally that the deflection of a gravitational lens decreases with the increase of the distance of 


closest approach of the deflected ray. 


. Show that the deflection of light by gravity can be considerable only if the gravitating object is compact 


(e.g. black hole). 
Show that in gravitational lensing the lensed object cannot be observed when the distance of the 
observer from the lens is less than a certain amount, and give an estimate of this amount. 


8.3. Gravitational Time Dilation 


The essence of gravitational time dilation in simple terms is that clocks slow down as they descend in a 
gravitational well and speed up as they ascend in a gravitational well. In fact, gravitational time dilation 
(assuming a Schwarzschild spacetime) was investigated rather thoroughly in § 6.3.3 and hence we do not 
repeat the investigation here. 


Problems 


ul 


Derive a formula for the relation between proper time intervals in a Schwarzschild spacetime as recorded 
by two local stationary frames that are at different radial distances from the source of gravity. How 
this formula becomes when one frame is at infinity, i.e. very far away from the source of gravity? 
Answer: If dr, and dt» are the proper time intervals recorded by frame O, at r; and frame O2 at r2 
(where r; and rz are the radial distances from a source of gravity of mass M and r, and rz are greater 
than 24") then from Eq. 245 we have: 


1/2 -1/2 
dry _ ( a (1- 26M) ae 


Cre 


Now, if Og is at infinity (ie. rg + oo) then this formula becomes: 


(303) 


Cry 


-1/2 
ATs, = dt. = dr (1 — | 
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i.e. the proper time interval dz. that is recorded in the absence of gravity (i.e. at infinity) is longer 
than the proper time interval dr, that is recorded in the presence of gravity, i.e. at finite radial distance 
r;.?28] In other words, the clock at infinity runs faster than the other clock which means that the 
effect of gravity is to slow down clocks. 
Note: r; and r2 in the formula represent radial coordinates but in the answer we assumed that they 
represent radial distances (as required by the question). So, if we have to be consistent then we should 
obtain the radial coordinates that correspond to the radial distances (which are supposedly available 
from the physical setting according to the question) and use these radial coordinates in the formula 
(assuming that obtaining the radial coordinates from the available radial distances is possible). This 
should highlight the practical difficulties that are usually faced in solving general relativistic problems 
that involve coordinate variables (whereas in reality we have physical variables). In fact, we can easily 
avoid this difficulty by changing the question to: “at different radial coordinates” instead of “at different 
radial distances” but this will not address the essence of this difficulty where we can legitimately ask 
about problems that involve physical variables (which what we usually have and where the questions 
about them is usually more natural). 

2. What is the fractional change in the time interval that is experienced by a clock as it moves!??9! from 
the surface of the Earth to very far away from the Earth (i.e. infinity)? 
Answer: Assuming a Schwarzschild spacetime (see the upcoming note 3), the fractional change is 
given by: 


dm-dm _ ,_ 4 
dt. - dt2 
1/2 
= 1-1-5") 
Cory 


l2 


1/2 
1 _ (1 2% 8.874 x 107" x 5.972 x 1074 ‘ 
(3 x 108)” x 6.371 x 106 


~ 6.951 x 107° 


where in line 2 we use Eq. 303, and in line 3 we use the mass of the Earth for M and the radius of 
the Earth for r,. 

Note 1: in this answer we used the radius of the Earth for r;. This seems inconsistent with the fact 
that the radius of the Earth is a physical radius while r; is a coordinate radius. The justification of 
this according to some general relativists is that the radius of a circle is its circumference divided by 
27. So, let use this definition of radius to find the coordinate radius r that corresponds to the physical 
radius of the Earth R in the Schwarzschild metric. Now, the radius of a great circle of a sphere (which 
represents the Earth) is the same as the radius of the sphere. We can use (with no loss of generality) 
the equator circle (which corresponds to 0 = 7/2 in the Schwarzschild coordinates) in this calculation. 
On the equator circle the ct, r and 6 coordinates are constants (because in measuring length ct is fixed, 
r is the radius of a circle and 6 = 7/2) and hence dt = dr = dO = 0. Accordingly, on the equator circle 
the Schwarzschild line element becomes: 


ds =rsin 5 dé = rdé 


where ds is an infinitesimal line element on the physical circumference and r is the coordinate radius. 
The physical circumference s is then obtained by integrating ds from ¢ = 0 to ¢ = 27, that is: 


20 
a=i r do = 2ar 
0 


1/2 
[228] For finite r and 26M < 1 we have (1 — 2G) Su: 


crr 
[229] We note that “moves” in this question and its alike means change of its location with no consideration of any kinematical 
effects. 
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Now, if we note that the physical circumference is 27 times the physical radius (i.e. s = 27R) then 
from the last equation we get 27R = 2mr and hence R = r, i.e. the physical radius is equal to the 
coordinate radius. 

We note that this general relativistic argument is crucially based on the premise: “the physical circum- 
ference is 27 times the physical radius (i.e. s = 27R)” which is valid in flat space but not in curved 
space. As discussed earlier (see § 2.3), in curved space the relation between the circumference C of a 
circle and its diameter D is C 4 1D while in the above argument we have C = 7D (where C and D 
refer to the metrical quantities). So in brief, the above definition of radius (i.e. the radius of a circle is 
its circumference divided by 27) is incorrect (if radius means physical radius) since it holds only in flat 
space. The physical radius should therefore be obtained by integrating the line element ds along the 
radial direction (i.e. where only dr changes in the metric).!?3°! In fact, the above general relativistic 
argument can be easily refuted (with no need for any detailed considerations) by the obvious fact that 
in curved space the radial coordinate differential dr is not equal to the radial proper length differential 
ds (as can be seen from Eq. 256) and hence the radial distance (or physical radius) cannot be equal 
to the radial coordinate (or coordinate radius). 

We should note that in the literature of general relativity there are similar arguments to the above 
argument; some of which are more elaborate and may appear technically more sound than the above 
argument but none of these are sufficiently convincing and can overcome this difficulty. Moreover, this 
problem is not restricted to the radial coordinate r or to the Schwarzschild geometry and hence even 
if we find a solution to this problem for r in the Schwarzschild geometry there is no guarantee that we 
can find a solution for other coordinate variables and in other spacetime geometries in all cases and 
circumstances. 

We should finally remark that in the literature of general relativity some authors distinguish between 
radial coordinate and radial distance quantitatively (i.e. they treat them as unequal) while other au- 
thors treat them as quantitatively equal. Moreover, some authors claim that the radial distance of a 
point from the origin is not defined.!?3!! So, we have at least three opinions about this issue. 

Note 2: the above practical problem about solving problems involving coordinate variables (which 
are usually unknown) may be eased in the case of weak gravity (as it is the case in classical sys- 
tems) where the difference between the physical and coordinate variables is usually negligible (since 
the spacetime is essentially flat) and hence we can use the known physical variables in place of the 
unknown coordinate variables. However, this does not address this problem completely because this is 
limited to weak-gravity systems and hence we still face this problem in strong-gravity systems where 
the physical and coordinate variables differ significantly. Moreover, even in the weak-gravity systems 
the effect of the difference between the physical and coordinate variables can become significant by 
accumulation over extended space or/and time period due to the violation of locality as seen in the 
case of perihelion precession (see the note of problem 1 of § 8.1 and refer to § 9.1). Furthermore, the 
convergence of coordinate variables to physical variables in weak-gravity systems may apply in the 
Schwarzschild coordinates but not necessarily in other coordinates where the relation between the two 
sets of variables can be more complicated. Anyway, this may be acceptable as a practical solution but 
it cannot be regarded as a valid theoretical solution that addresses this problem fundamentally and 
from its roots. 

Note 3: the use of the Schwarzschild metric in this question and its alike may be justified by being a 
good approximation or by being an idealization. Otherwise, the Earth is not a Schwarzschild object 
due for example to its rotation around its axis (as well as revolution around the Sun) and its lack 
of spherical symmetry. Moreover, the spacetime surrounding the Earth is not really a Schwarzschild 
spacetime due for example to the existence of other gravitating objects like the Sun and other planets. 
Also, the surrounding space is not really empty (as required by the Schwarzschild metric as a vacuum 


[230] We note that calculating such an integration is not viable in practical situations where mass varies along the radial 
direction especially if the body is not spherically symmetric. Yes, in the case of black holes the situation is eased by 
the assumption of singularity. 

[231] Although this seems to be an attempt to address this problem, it actually worsens it because if it is not defined then 
we cannot solve problems that involve radial distance from the origin. 
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solution) due to the existence of matter and electromagnetic energy (although in very tiny quantities). 


8.4 Gravitational Frequency Shift 


Gravitational frequency shift means that a light signal ascending in a gravitational well will be red shifted 
while a light signal descending in a gravitational well will be blue shifted. Referring to our discussion in 
§ 6.3.4, this effect according to general relativity is based on the effect of gravitational time dilation due 
to the reciprocal relation between periodic time and frequency. Gravitational frequency shift may also 
be explained classically (at least qualitatively) by the principle of energy conservation because a photon 
of light ascending in a gravitational well will increase its potential energy and hence this gain in energy 
should be compensated by a red shift, while a photon descending into a gravitational well will decrease its 
potential energy and hence this loss in energy should be compensated by a blue shift.?°7! Accordingly, this 
effect can be predicted in principle even by classical physics. As indicated above, gravitational frequency 
shift was investigated earlier (from a general relativistic viewpoint assuming a Schwarzschild spacetime) 
in § 6.3.4 and hence we do not repeat here. 

We should remark that if gravitational frequency shift is based on gravitational time dilation (as it 
is the case in general relativity) then a single explanation (i.e. gravitational time dilation) applies to 
both time dilation and frequency shift. However, we may need two different explanations (i.e. one for 
time dilation and one for frequency shift) if we used for example the conservation of energy (following a 
classical approach) to explain gravitational frequency shift because the conservation of energy may not 
be able to explain the observations that are specifically related to time dilation, e.g. when we compare 
two clocks at two different levels in a gravitational potential well and hence there is no frequency shift of 
radiation (because there is no descent or ascent of radiation in a gravitational well) to be explained by 
energy conservation. This issue will be investigated further in the following paragraphs. 

Let have two observers A and B who have identical sources of monochromatic radiation, i.e. when A and 
B are at the same location in the spacetime the frequency of the source of A is identical to the frequency 
of the source of B. Also, let assume that A is located at a higher potential in a gravitational well than B. 
Now, we have two physical situations that need to be investigated. The first is when each one of A and 
B observes his own source without sending signal to the other although they can compare the frequencies 
of their sources (e.g. by using the frequency as a basis for a clock that can count and store time). The 
second is when A and B exchange signals from their sources (i.e. A sends signal to B and B sends signal 
to A) and hence they can compare the frequencies of their sources directly each at his location. Now, 
instead of starting from theoretical analysis let start from observations where it is claimed that in the first 
situation the clock of A runs faster than the clock of B (and hence the frequency of A source is higher 
than the frequency of B source), while in the second situation A will observe the signal of B red shifted 
and B will observe the signal of A blue shifted. In the following paragraphs we will try to analyze and 
explain these observations general relativistically and classically. 

According to general relativity, time runs faster at the location of A and hence both situations can 
be explained by gravitational time dilation. As for the first situation, since time runs faster at the location 
of A then the clock of A should run faster and this means that his source of radiation (which is the basis 
of his clock) should have higher frequency compared to the frequency of the source of B. As for the second 
situation, since time runs faster at the location of A then the unit of time of A should be shorter than 
the unit of time of B and hence if the frequency of the identical sources at A and B is calibrated by the 
units of time at a given location (i.e. at A or at B) then it will be higher at B. For example, if we have 10 
cycles of radiation that descended from A to B where the size of the unit of time at B is twice the size of 
the unit of time at A (due to time dilation) then these 10 cycles (which represent 10 units of time at their 
location of emission at A) will be seen at B to occur in a time interval that is half their time interval at 
A (i.e. they occur in 5 time units of B since they correspond to 5 cycles of B) and hence their frequency 


[232] We note that according to Planck’s relation E = hv the photon kinetic energy FE is proportional to its frequency v and 
hence red shift means decrease in kinetic energy while blue shift means increase in kinetic energy. The involvement of 
Planck’s relation should make the explanation partially non-classic although it is classic from a gravitational perspective. 
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at B will be seen as twice their frequency at A. Accordingly, we can claim that gravitational time dilation 
(according to general relativity) provides a single and consistent explanation for both situations. 

According to classical physics, although the second situation can be explained by energy conservation 
(see Exercises), it is not obvious that the first situation can also be explained by energy conservation unless 
we assume that the difference in energy (due to the difference in frequency) at the two locations represents 
an energy loss in or by the gravitational field (and hence if the two sources emit the same amount of energy 
at the two locations then there should be an amount of energy stored in the field; otherwise the amount of 
energy emitted by the sources should depend on the strength of the gravitational field at their locations and 
hence source A emits more energy than source B although this may not be based on energy conservation). 
To be more clear we can say: if we have to follow the reported observations then identical radiation 
sources will emit radiations at different frequencies at A and B. Hence, the energy of the radiation at 
B will be lower than that at A. As the radiation emitted at A descends to B its potential energy will 
decrease and hence its frequency will increase further (i.e. by energy conservation apart from the increase 
by the emission process at A as compared to the emission process at B). Accordingly, the upcoming 
derivation (see Exercises) that is based on the conservation of energy (of descending/ascending signal) 
should account only for the part of the change of frequency that is caused by the descent /ascent process. 
The significance of this is that we may find classical derivations that may agree with the general relativistic 
predictions in general when we account for all the causes of change of frequency and energy (i.e. it is 
possible that the quantitative difference between the predictions of general relativity and classical gravity 
in some cases is because the conservation of energy can partly explain gravitational frequency shift). The 
interpretation of the change of frequency at the emission process will then require the assumption that 
the emission process is affected by the gravitational field such that the energy at emission is a function of 
the gravitational field. Anyway, this classical explanation of the first situation (even if it is assumed to be 
rational) seems less convincing than the general relativistic explanation. Nevertheless, we will continue 
to consider the classical explanations (like energy conservation) of the phenomena that are commonly 
classified as gravitational time dilation as viable explanations at least in principle and in some cases and 
instances (e.g. second situation).|?3%) 

We should finally note that considering the close relation between energy and time (e.g. energy conser- 
vation is a consequence of the homogeneity of time, and energy divided by c is the temporal component 
of 4-momentum), the classical explanation of gravitational frequency shift by energy conservation and 
the general relativistic explanation by time dilation may originate from the same principle and this could 
partly explain the closeness of the predictions of these two explanations at least in some physical systems 
and circumstances. In fact, even the Planck relation E = hy can be added to the examples of the close 
relation between energy and time since the Planck relation is essentially a relation between energy and 
reciprocal time (i.e. frequency) and hence we may need to look for a more fundamental principle that 
may explain not only the close relation between time dilation and energy conservation but even physical 
relations, like the Planck relation, that correlate energy to time or frequency (also see Problems). 


Problems 


1. Discuss the above general relativistic analysis and its implication on energy conservation. 
Answer: As explained above, the source of A is at a higher level in the gravitational well than the 
source of B. Now, according to general relativity the time at B runs slower than the time at A and hence 
identical sources at A and B will be seen by A and B to have identical frequencies and identical energies 
when each frequency is observed in its location of emission because the units of time at A and B are 
different, i.e. each one calibrates his observations by his own time unit. So, to compare the frequencies 
of A and B meaningfully the signal of B should ascend to A (or the signal of A should descend to B), 
i.e. the signal at one location should be moved to the other location so that we can compare the two 
signals sensibly at the same location since the time unit is unique at any specific location. The logical 
scenario in this case is that the signal of B will be seen at A to have lower frequency and energy (or 
the signal of A will be seen at B to have higher frequency and energy) because the unit of time at B is 


[233] The main reason for the insistence on the viability of the classical explanations is that in most cases they produce almost 
identical results to the results of general relativity. 
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larger than the unit of time at A. However, this does not violate the universality of time only but it also 
violates the universality of energy (since the units of energy are dependent on the units of time which 
are position dependent) and hence the conservation of energy at global level will lose its significance. 
In fact, the invalidity of the conservation of energy globally in general relativity can be seen as a logical 
consequence of the invalidity of global time in general relativity since time (which is a basis for the 
quantification of energy as seen for example in the Planck relation or in the correspondence of energy 
to the temporal component of the momentum 4-vector) has only local significance. 

Investigate and assess some of the qualitative arguments for the gravitational frequency shift that 
circulate in the literature of general relativity. 

Answer: One of the common arguments in the literature of general relativity is based on the use of 
the equivalence principle. In brief, a light signal in an accelerating frame moving along the orientation 
of the propagation of signal (ie. in the same direction or in the opposite direction) should experi- 
ence frequency shift, i.e. “expansion” of waves and hence red shift in the case of same direction and 
“compression” of waves and hence blue shift in the case of opposite direction. Hence, according to 
the equivalence principle the same should happen in an equivalent gravitational frame. However, this 
argument may be criticized by the following: 

e The choice of “expansion” and “compression” in the two cases seems arbitrary and the reverse may 
also be claimed and this should spoil the whole argument. In fact, both claims require the presumption 
that light has a proper (or primary) frame and improper (or secondary) frame and this could lead to 
the necessity of assuming an absolute or privileged frame for the propagation of light. The analysis 
may also require considering the fact (which we found in B4 from analyzing the formalism of Lorentz 
mechanics) that the light signal obtains a velocity component from its source although the speed of 
light remains constant. So, if we extend the validity of this finding from inertial frames to non-inertial 
frames then we may say: if the source of light is in the accelerating frame then the compression and 
expansion should follow one of the above patterns while if the source of light is not in the accelerating 
frame then the compression and expansion should follow an opposite pattern (although other patterns 
may also be possible). 

e We may claim that the opposite argument may be more appropriate and logical, that is the frequency 
shift in a gravitational frame is justified first (e.g. by energy conservation or something else), then the 
frequency shift in an equivalent accelerating frame is inferred from the equivalence principle. However, 
this course of action could defeat the original purpose of the equivalence principle as an accessory for 
establishing facts and obtaining results about gravity and its consequences. 


Exercises 


1; 


Derive a classical formula for the gravitational frequency shift and hence compare the classical and 
general relativistic formulae. Also, determine how the classical formula becomes when the observer is 
at infinity. 

Show that the general relativistic formula for the gravitational frequency shift agrees with the clas- 
sical formula (see exercise 1) by using power series approximation with reasonable conditions and 
assumptions. 

Find the fractional change in the frequency of a light signal emitted on the surface of a star (say white 
dwarf) with one solar mass and one Earth radius when it reaches the Earth. Use in your answer both 
the general relativistic and classical formulae and compare the results. 

Write the gravitational frequency shift formula of general relativity and classical physics in terms of 
wavelength. 

Show that if ou <1 then the gravitational frequency shift formula of general relativity converges to 
its classical counterpart at infinity (i.e. rg > 00). 

Use the classical and general relativistic formulations to find the wavelength of a light signal emitted 
at the surface of the Sun with a wavelength 41 = 500 nm when it reaches the Earth. 

Repeat the previous exercise but assume this time that the Sun has the same radius as the Earth. 
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8.5 Gravitational Length Contraction 


The essence of gravitational length contraction in simple terms is that measuring sticks shrink in the 
radial direction as they ascend in a gravitational well and stretch in the radial direction as they descend in 
a gravitational well. In fact, gravitational length contraction (assuming a Schwarzschild spacetime) was 
investigated rather thoroughly in § 6.4.3 and hence we do not repeat the investigation here.|?341 


Problems 


1. There are some opposite physical interpretations in the literature of general relativity about length 
contraction (as if it is length dilation) and its meaning. Discuss this issue. 
Answer: Apart from the possibility in some cases of being misunderstanding or mistake, some of 
these interpretations are based on using different metrics. The above description of gravitational length 
contraction is based on the Schwarzschild metric. The origin of some of these conflicting interpretations 
may also be the difference in perspective (similar to what we find in special relativity about similar 
issues; refer to B4) and the language of presentation. 


Exercises 


1. Derive a general relativistic formula for the relation between proper infinitesimal lengths along the 
radial direction in a Schwarzschild spacetime as obtained in two local stationary frames that are at 
different radial distance from the source of gravity. How this formula becomes when one frame is at 
infinity, i.e. very far away from the source of gravity? 

2. What is the fractional change in the infinitesimal length along the radial direction that is experienced 
by a stick as it moves from the surface of the Earth to very far away from the Earth (i.e. infinity)? 

3. List some issues that can be a source of controversy and conflict in the gravitational length contrac- 
tion. [235] 

4. In § 6.3.4 we derived gravitational frequency shift (see Eq. 251) from gravitational time dilation. Try 
to derive (or rather deduce) gravitational frequency shift from gravitational length contraction. 


8.6 Gravitational Waves 


Gravitational waves is one of the consequences and predictions of general relativity where fluctuations or 
ripples in the geometry of spacetime (i.e. gravitational waves) are generated by the motion of massive 
objects. These ripples are like the ripples generated by the motion of electric charge in the form of 
electromagnetic waves. In brief, as accelerated electric charges emit electromagnetic waves that carry 
energy and momentum, accelerated masses should also emit gravitational waves that carry energy and 
momentum (noting that the Birkhoff theorem puts some restrictions related to symmetry). For example, 
two gravitating bodies (e.g. neutron stars) orbiting around their center of mass should emit gravitational 
waves continuously due to their accelerated motion and hence they continue to lose energy in the form of 
gravitational waves and this leads to gradual contraction of their orbit and potentially eventual collapse 
and merge.!?35l We note that according to general relativity the characteristic speed of gravitational waves 
is the same as the characteristic speed of light c. 

Anyway, the theory of gravitational waves is very lengthy and complex with very trivial practical value 
and outcome and hence it is beyond the scope and objectives of this book. In fact, large parts of the 
gravitational waves theory are beyond the capability of modern science to verify or falsify and hence they 
should be regarded more aptly as mathematical speculations and curiosities rather than physical theories 


[234] We should note that spatial distance (or length) is ambiguous when the metric is time dependent because it will also be 
time dependent due to its dependency on the actual trajectory (or world line) that connects the initial and final points 
in the spacetime. However, this does not apply to the Schwarzschild geometry which is time independent. Anyway, the 
validity of length contraction should not depend on this issue because even in the case of time dependency there could 
still be length contraction (or dilation) as a general feature even if it is time dependent. The same may be said about 
time interval when the metric of spacetime is time dependent. 

[235] This question is general and is not restricted to a specific metric like Schwarzschild. 

[236] Although this scenario applies to any such orbiting system (like the solar system) the rate of energy emission is so low 
on astronomical scale that it takes very long time (cosmological time scale) for these systems to collapse. This should 
explain why these systems are generally stable within the window of observation. 


8.7 Black Holes 175 


(see § 1.9). Therefore, we refer the interested reader to the literature of general relativity for details on 
this subject (although we strongly advise against spending valuable time on this virtually useless theory 
unless there are purely theoretical justifications and motivations). 


Problems 


1. List some of the properties of gravitational waves. 
Answer: According to the literature of general relativity: 
e Gravitational waves are ripples in the geometry of spacetime rather than ripples contained in the 
spacetime (as it is the case for example in electromagnetic waves). 
e They are generated by accelerating massive objects. 
e Like ordinary waves, they are characterized by wavelength and frequency. 
e Like ordinary waves, they carry energy and momentum.|?97] 
e Like electromagnetic waves, they propagate with the speed of light. 

2. Give examples of the physical events that are expected to generate gravitational waves. 
Answer: Examples are: 
e Massive objects (e.g. two stars) orbiting around their center of mass. 
e Collision and merger of massive objects (e.g. two black holes or two galaxies). 
e Explosion in astronomical objects or collapse of stellar cores (like nebulae and supernovae). However, 
for such astronomical events to generate gravitational waves they must be spherically asymmetric 
because otherwise we will have a Schwarzschild spacetime (according to the Birkhoff theorem) which 
is time independent (see § 4.1.1). 
e Rotation of spherically asymmetric compact objects (like white dwarfs). 
e Violent events during the creation of the Universe (i.e. Big Bang). 

3. Should a radially pulsating star emit gravitational waves? 
Answer: No, because according to the Birkhoff theorem as long as the matter distribution of the 
gravitating body is spherically symmetric the geometry of spacetime in the vacuum region is described 
by the Schwarzschild metric even if the matter distribution varies in time. Now, since Schwarzschild 
metric is time independent there should be no disturbance in the spacetime, i.e. there is no emission 
of gravitational waves. 


Exercises 


1. Assess the sensibility and rationality of the existence of gravitational waves. 

2. Assess the commonly accepted claim that the idea of gravitational waves is the brainchild of Einstein. 

3. Why a spherically symmetric gravitating object cannot generate gravitational waves even if it varies 
in time (i.e. while remaining spherically symmetric)? 

4, Justify the fact that gravitational waves are much weaker than electromagnetic waves (where weaker 
and stronger in this context may be quantified generically by the ease of detection or by the amount 
of energy they carry). 

5. Why we might have “gravity of gravity” (according to general relativity or general relativists) but not 
“electromagnetism of electromagnetism”? 

6. Outline the formulation of gravitational waves in general relativity using the linearized form of the 
Field Equation. 


8.7 Black Holes 


We should note first that this section about black holes is very brief (compared to similar sections or 
chapters in other books of general relativity) because the scope of our book is general relativity and hence 
its applications (in black holes and cosmology for example) are not of major interest to us. Moreover, 
we are skeptical about the physical reality of many of those mathematically oriented investigations and 
models of black holes that are based on the formulation of general relativity since any authentic physical 


[237] Whether gravitational energy and momentum can be a source term in the Field Equations seems to be a controversial 
issue (which is a typical state in the relativity theories) although the common opinion among general relativists seems 
to deny this (where it may be justified by their implicit inclusion through the non-linearity of the Field Equations). 
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theory should be primarily driven by observations and experiments rather than by mathematical models. 
However, many important issues about black holes are addressed in the problems and exercises (which 
the reader can skip with no regret and with no loss of continuity) .|?38 

Black holes, which are regarded as one of the dramatic and exotic predictions of general relativity, are 
massive compact objects whose matter is highly concentrated in space (i.e. they have very high density) 
to such a limit that even light cannot escape from the grip of their strong gravitational field.!?3°! This 
means that any light emitted (hypothetically) from a black hole toward outside will bend and return back 
to the black hole and hence no light from the black hole can reach an outside observer and this should 
explain why they are “black”. This is a direct consequence of the distortion of the spacetime caused by 
the dense concentration of matter according to general relativity because the distortion of spacetime in 
the vicinity of a black hole is so dramatic that all the geodesics originating from the black hole bend and 
turn back to the black hole. However, as we will see black holes can also be obtained (at least in principle 
regardless of any particular derivation) from a classical gravitational formulation with no need for this 
geometric paradigm. 

Although it is impossible to observe black holes directly (because they are “black”) black holes can be 
observed in principle indirectly by observing the effect of their gravitational field on neighboring objects 
(such as stars) or by observing energetic emissions (such as X-rays) generated by the accretion of gas, 
dust and debris onto the black holes from their surrounding. Black holes are allegedly detected (firmly or 
tentatively) or contemplated in certain binary orbiting systems and in the center of some galaxies. There 
are also theoretical speculations that they might exist randomly in space as residues from the alleged Big 
Bang and the subsequent processes in the evolution of the Universe. 

It is generally claimed that there are three main types of black hole (although several other types, such 
as intermediate black holes, can also be found in the literature): 

e Super-massive black holes which are supposed to exist in the center of most galaxies especially the large 
ones. 

e Stellar black holes which are supposed to be created by the collapse of massive stars or the merger of 
binary objects. 

e Miniature (or primordial) black holes which are supposed to be created by violent events and inhomo- 
geneities during the Big Bang. 

According to the theoretical models, black holes have very simple “structure” and hence (or because) 
they are characterized by only three mutually independent and externally observable intrinsic physical 
properties: mass, electric charge and angular momentum.!?4°l This may be stated colloquially as: “black 
holes have no hair” (which is commonly known as the “no hair theorem”). Black holes are commonly seen 
as physical singularities in the spacetime where mass accumulates in a single point of space and hence they 
are infinitely dense.!?4!] The singularity of black holes may also be interpreted differently and correlated 
to other parameters (e.g. curvature of spacetime). 

Black holes are also supposed to be surrounded by an imaginary sphere (centered on the singularity) 
called the “event horizon” which is where the escape speed becomes equal to the characteristic speed of 
light c. Accordingly, nothing (including light) can escape from within the event horizon and hence what 
falls inside the event horizon will be trapped there forever (although black holes “evaporate” quantum- 
mechanically but not classically according to some theories; see § 8.7.3). The radius of the event horizon 
is known as the Schwarzschild radius which is given by Rg = 26M where M is the mass of the black hole. 


238] Because the general relativistic literature on black holes and related topics is congested with nonsense, fantasies, ab- 


surdities and paradoxes we strongly advise against spending valuable time on these topics. In fact, our investigation is 
mainly for the purpose of completeness and to meet the demand of some readers (and that is why it is kept brief). 
The compactness and high density of black holes will be discussed later where we will see that they may not apply 
equally to all types of black hole according to certain criteria. 

In fact, angular momentum is not entirely independent of mass although it is independent (due to its dependency on 
other parameters). Also, the condition “intrinsic” is meant to exclude observer-dependent (i.e. extrinsic) properties such 
as velocity and kinetic energy. 

Before the discovery of neutron and neutron degeneracy pressure, it was believed that “neutron star” should collapse to 
a point (and hence become a singularity). We may currently be in a similar situation with regard to black holes where 
some discovery in the future may prevent the collapse of black hole to a point and becoming a singularity. 
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A rotating black hole has also an ergo-region (or ergo-sphere) around its event horizon where spacetime is 
dragged forcibly by the rotation of the black hole and hence no object within the ergo-region can remain 
stationary (relative to inertial observer at infinity). Another claim about black holes is that falling toward 
a black hole is an endless journey as observed by an external observer (see Exercises). 

We should remark that many of the alleged “facts” about black holes are based on highly abstract 
mathematical models with no physical basis and hence they should not be taken seriously as scientific 
facts although they are commonly presented as such (in fact we are skeptical even about the mere existence 
of black holes despite the recent affirmative claims about the detection of these objects). This is one of 
the main problems in modern physics where the theory leads, rather than follows, the experiment and 
observation and hence in many cases we find the role of experiment and observation is to search for 
and harvest evidence in support of a predetermined theory. Accordingly, many bizarre illusions (such as 
wormholes, time machines, white holes, !?4?] and dark energy) have emerged in recent times thanks to this 
approach (which is largely based on general relativity and its methodology and framework) where they 
are presented as respected scientific theories or even as scientific facts. In fact, the literature of black holes 
(which is almost entirely based on general relativity) is full of these mathematical fantasies and illusions. 
Therefore, the audience of modern physics should be vigilant about these claims to avoid being fooled by 
these theories and “facts”. Anyway, we believe that the existence of black holes (if proved observationally) 
can be explained even by classical gravity with no need for these illusions and fantasies (see the Exercises 
of this section and refer to § 9.7). 


Problems 


1. What we mean by “singularity” when we talk about black holes? 

Answer: Singularity in general relativity means a point in spacetime with an infinite property which 
(in the case of black holes) is the mass density (or curvature of spacetime) because the mass is supposed 
to be accumulated in a single point of space and hence the mass density at that point is infinite. 
However, some may view black holes as confined regions in spacetime with certain properties without 
talking explicitly about infinite density (or infinite curvature) to escape potential criticism or to allow 
possible intervention of other physical theories (namely quantum mechanics) that may modify (and 
hence prevent) the general relativistic nature of black holes as singularities in spacetime. 

2. Analyze the meaning of “radius” in “Schwarzschild radius”. 

Answer: Based on the general relativistic derivation of the expression of Schwarzschild radius (using 
Schwarzschild metric as explained in § 4.1.1 and § 5.1) it should mean coordinate radius rather than 
metrical radius although they may be seen equal according to some opinions (see problem 2 of § 8.3). 
This should have an impact on other aspects related to black holes like “average density” and “volume 
inside event horizon”. However, we generally do not make such restrictions and clarifications to avoid 
unnecessary complications where we rely on this understanding. 

3. Black holes are commonly described as compact objects. Does this apply to all types of black hole? 
Answer: If“compact” means high average density (where average density is defined as the mass divided 
by the volume inside the event horizon) then being compact objects applies only to some types of black 
hole. As we will see in the Exercises, the average density of some black holes (i.e. the very “heavy” 
ones) is very low. So, the characteristic property of black hole is being a concentration of a quantity of 
matter within its Schwarzschild radius which results in preventing light inside its event horizon from 
escaping to outside. Yes, if we accept the claim that black holes are physical singularities, then all black 
holes should have infinite density although their average density may be very low. However, we do not 
believe in the existence of physical singularities because there is no infinite quantities in the real world 
although such quantities may exist in abstract mathematical spaces. Nevertheless, the actual density 
of black holes (even the very “heavy” ones) could be much higher than their average density because 
the mass can be confined to a volume less than the volume inside the event horizon. We should also 
draw the attention to the peculiar meaning of “volume inside the event horizon” due to the presumed 
difference between coordinate variables and metrical variables as indicated in the previous exercise. In 
fact, it is more appropriate in this context to think general relativistically rather than classically. 


[242] White hole may be defined as the time-reverse of black hole. 
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4. 


What is the mass of a black hole whose Schwarzschild radius is 3.5 km? 
Answer: We have Rg = 22 = 3500 m. Hence: 


C2 


CRs (3x 108)” x 3500 


M= Ga ~2x6674x 10-0 


~ 2.360 x 10°° kg 


which is more than the mass of the Sun. 


Exercises 


oRWNEH 


13. 


14. 
15. 
16. 
17. 
18. 
19. 
20. 
21. 


22. 


23. 


24. 
25. 


. Give a succinct and simple definition of black hole. 

. Define event horizon and Schwarzschild radius of black hole. 

. In what sense black holes are “black” and “holes”? 

. How do we detect and observe black holes if they are “black”? 

. Give examples of the mechanisms by which black holes can release energy (regardless of the energy 


release being by the black hole on its own or through its interaction with its surrounding). 


. Make a rough estimate of the energy released by accretion of a certain amount of matter of mass m 


onto a non-rotating black hole of mass M. Comment on the result. 


. List some of the properties of black holes. 

. What we mean by the structure of black hole? 

. What are the main schemes for classifying black holes? 

. Derive a mathematical expression for the Schwarzschild radius using a classical argument. What is the 


significance of this classical derivation? 


. Assess the classical derivation of the previous exercise. 
. What is the Schwarzschild radius of a black hole with a mass (a) equal to the mass of the Earth and 


(b) equal to the mass of the Sun? 

Use some of the previously developed general relativistic formulae related to gravitational time dilation 
and gravitational frequency shift to infer some of the physical properties and consequences of black 
holes. 

Discuss the mathematical singularities of black holes assuming a Schwarzschild metric. Do these 
singularities mean that black holes are physical singularities? 

Can we ascribe Schwarzschild radius to objects other than black holes? 

What characterizes the event horizon of black holes? 

Develop a formula for the average mass density of black holes and analyze it. 

Calculate the average density of a black hole whose mass is (a) equal to the mass of the Earth, (b) 
equal to the mass of the Sun and (c) equal to 10° solar masses. Compare these densities to the density 
of the atomic nucleus and comment. 

What can you conclude from the results of the last two questions with regard to the likelihood of the 
formation of black holes? 

Derive a classical formula for the gravitational field gradient of black hole along the radial direction 
and analyze its classical implications. 

Considering that black holes are fully characterized by three properties: mass, angular momentum and 
electric charge, what are the main types of black hole from this perspective? 

Considering the mass range of black holes, what are the main categories of black hole from this 
perspective? Provide some details about these categories. 

Describe briefly the free fall journey of an astronaut A toward a non-rotating black hole as experienced 
by him and as observed by a stationary observer B located at infinity. Assume that A is initially at 
infinity??4°! and he is falling from rest and hence he follows a radial path toward the black hole. Also, 
assume that A and B have identical light sources and identical and initially-synchronized clocks. 
State the hypothesis (or conjecture) of cosmic censorship and comment. 

What is the implication of the “no hair theorem” (i.e. black holes are characterized by only three 
observable independent physical properties: mass, angular momentum and electric charge) with regard 


[243] Infinity here should mean very far where the gravitational field is very small but not zero. 
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Figure 7: A 2D schematic of the structure of a Schwarzschild black hole (and the surrounding space) 
where the singularity S$, event horizon EH, Schwarzschild radius Rs and photon sphere PS are indicated. 


to the issue of absolute frame? What is the implication with regard to the issue of being physical 
singularities? 


8.7.1 Schwarzschild Black Holes 


These are static black holes characterized by mass M > 0, zero angular momentum J = 0, and zero 
electric charge Q = 0. Accordingly, the spacetime that associates these black holes in the absence of 
other sources of gravity is described by the Schwarzschild metric as their name suggests (see § 4.1). The 
“structure” of the Schwarzschild black hole (and the surrounding space) is very simple as it consists of 
a singularity surrounded by an imaginary sphere (i.e. event horizon) of “radius” Rg centered on the 
singularity (see Figure 7). A photon sphere, which is a sphere like event horizon but with a “radius” of 
1.5Rg (see Figure 7 and Exercises), may also be added to this structure. Schwarzschild black holes should 
be considered the simplest objects in the physical world since they are characterized by mass only and 
hence two Schwarzschild black holes of the same mass are identical. So, in a sense a Schwarzschild black 
hole of a given mass has no identity as it is no more than a chunk of matter in spacetime. 

We should remark that describing this category of black holes as Schwarzschild black holes can be 
misleading since it may suggest that the entire spacetime surrounding these black holes is necessarily 
described by the Schwarzschild metric. In fact, “Schwarzschild black hole” should only mean a black hole 
with mass, zero angular momentum and zero electric charge even if the spacetime contains other sources 
of gravitation and hence the metric of the extended spacetime is more complex than the Schwarzschild 
metric. So, the “Schwarzschild” label originates from the fact that such a black hole is the same as the 
object in the Schwarzschild solution and hence if this black hole is the only object in the spacetime then it 
will generate the Schwarzschild metric. The “Schwarzschild” label should also indicate that the structure 
of the black hole (i.e. the features of the spacetime in the immediate neighborhood of the singularity 
such as event horizon and photon sphere) is described by the Schwarzschild metric although the extended 
spacetime may not be described by this metric. 


Problems 


1. What we mean by saying “two Schwarzschild black holes of the same mass are identical”? Can you 
find an analogy in more familiar and ordinary objects? 
Answer: We mean they are indistinguishable from each other by any intrinsic property although they 
could be distinguished by an extrinsic property such as being at two different points in spacetime. We 
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may make an analogy with two electrons or two protons which are indistinguishable from each other 
by any known intrinsic property such as mass or charge (within our observational limits) although 
they may be distinguished by extrinsic properties like their position or velocity or kinetic energy. 


Exercises 


1. Define Schwarzschild black hole in a few words. 

2. What is the photon sphere of Schwarzschild black hole? 

3. Derive the general relativistic formula for the “radius” of the photon sphere (ic. r = 1.5Rgs) of 
Schwarzschild black hole. 

4. Derive a classical formula for the radius of the “photon sphere of Schwarzschild black hole” and comment 
on its significance. 

5. Investigate the singularities of Schwarzschild black hole. 


8.7.2 Kerr Black Holes 


These are rotating black holes characterized by mass M > 0, angular momentum J ¥ 0, and zero electric 
charge Q = 0. Accordingly, the spacetime that associates these black holes is described by the Kerr 
solution as their name suggests (see § 4.2). Because these black holes are rotating, they have more 
complex structure than the simple structure of the static Schwarzschild black holes (refer to Figure 8). 
According to the Kerr solution of the Field Equations, rotating black holes are surrounded by a zone 
called the ergo-region (or ergo-sphere)|?441 where no object within this zone can stay at rest relative to 
an inertial frame at infinity because of frame dragging and hence any object in this region is forced to 
rotate in the sense of rotation of the black hole. However, objects within the ergo-region (and outside 
the event horizon) can still escape the gravitational grip of the black hole and hence matter and energy 
can be extracted from this region. The ergo-region is defined externally by an ellipsoidal surface (which 
has axial symmetry around the axis of rotation) called the stationary limit (or static limit) which is a 
surface of infinite red shift. The stationary limit may be described by some as the locus in spacetime 
where spacetime flows at the speed of light. 


Exercises 


1. What is the implication of having rotating (or Kerr) black holes? 

2. Why should Kerr black holes be the most common type of stellar black holes in the Universe (assuming 
they do exist)? 

3. What is the ergo-region of a rotating black hole? 

4. Mention some of the characteristic features of the ergo-region of a rotating black hole. 

5. Give some of the characteristic features of the stationary limit of a rotating black hole. 

6. Compare the event horizon and the surface of infinite red shift in the Schwarzschild and Kerr black 
holes. 

7. Classify the regions surrounding Kerr black holes according to the ability of a physical object (as seen 
by an inertial observer at infinity) to stand still and rotate in one sense or another. 

8. Try to use the terms “stationary” and “static” (as used commonly in the literature of general relativity) 
to describe Schwarzschild and Kerr black holes. 


8.7.3. Hawking Radiation 


This is a hypothetical effect predicted by Stephen Hawking as a result of applying quantum mechanics to 
general relativity and hence it is a quantum gravitational effect. In fact, the theory of Hawking radiation 
may be regarded as the first quantum gravitation theory. Hawking radiation may also be called Hawking 
effect or black hole evaporation. According to this theoretical prediction, black holes can emit black body 
radiation as a result of quantum mechanical processes in the vicinity of the event horizon and hence they 
“evaporate”. It should be obvious that because of the equivalence between mass and energy, emission 
of radiation means losing mass and hence “evaporation”. Accordingly, if these quantum losses are not 


[244] Tn our view, “ergo-region” is more appropriate than “ergo-sphere” (unless sphere means something like zone or field). 
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Figure 8: A 2D schematic of the structure of a Kerr black hole where the singularity 5, event horizon 
EH, ergo-region ER, stationary limit SL and the axis of rotation AR are indicated. We note that the 
singularity at the center is not a point but a ring of coordinate radius a in the equatorial plane (see § 4.2). 
We also note that some delicate details of the structure of Kerr black hole (e.g. inner event horizon) are 
not demonstrated in this simplistic illustration. 


compensated by gaining matter and energy from the surrounding of the black hole then the black hole 
can in theory evaporate totally and hence disappear. 

Hawking effect may seem contradictory to the premise that nothing (including radiation) can escape from 
the gravitational grip of black holes. However, this is addressed in the theory by a strange hypothetical 
scenario in which virtual particle-antiparticle pairs participate and hence there is no violation to this 
premise.!?45] In fact, this is one of the masterpieces of wild imagination in modern physics and hence 
we do not go through these useless details (although we will touch on the issue mildly in the questions). 
Anyway, it should be understood that the above premise is about classical emission processes related 
to classical black holes while Hawking radiation is a quantum mechanical process related to quantum 
mechanical black holes.!?461 In fact, Hawking radiation (according to the alleged mechanism of emission) 
is not a radiation originating from inside the black hole to violate the aforementioned premise. 

Hawking radiation is expressed quantitatively by the following relation which links the “temperature” 
of the black hole (which may be called the Hawking temperature) as a black body radiator to its mass: 


hc3 
a 8rkGM 


where T and M are the temperature and mass of the black hole, fi is the reduced Planck constant (= h/27), 
c is the speed of light, & is the Boltzmann constant and G is the gravitational constant. As we see, the 
temperature of black hole as a thermal radiator solely depends on its mass in a reciprocal relation and 
hence “light” black holes are hotter than “heavy” black holes. 

Apart from being an entirely theoretical effect with no observational foundation or verification, Hawking 
radiation has been criticized and challenged in its theoretical foundation, e.g. by the so called “information 
loss paradox” (which is a fantasy built on a fantasy). Anyway, if we accept Hawking radiation (and hence 


(304) 


[245] Virtual particles and antiparticles are paradigms representing physical entities within the theoretical framework of 
relativistic quantum mechanics. 

“Classical” in such contexts is opposite to quantum mechanical and hence even the relativity theories and their impli- 
cations are classical in this sense. Also, “quantum mechanical black holes” means black holes that follow the rules and 
laws of quantum physics. 


[246] 
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accept the above formula) then we may need to consider renaming these objects (e.g. by calling them 
“dark holes”) since they are not really “black”. We may also need to adjust the “no hair theorem” by 
adding the temperature as a fourth property of black holes (even if we interpret the temperature in a 
non-conventional sense) although this may be challenged by the dependency of temperature on mass. 
However, the quantitative dependency of T on M according to Eq. 304 does not affect the fact that 
mass and temperature are two independent properties even though they are linked through a certain 
quantitative relation. In fact, we have a similar dependency between mass and angular momentum and 
we still consider these as two independent properties according to the “no hair theorem”. However, we 
should admit that angular momentum depends on other variables and hence overall it is independent 
of mass; therefore the challenge could be valid. Also, the “no hair theorem” seems to be formulated 
specifically to classical black holes and hence the restriction to mass, angular momentum and electric 
charge is justified since temperature is quantum mechanical. 


Problems 


1. Discuss the issue of the credit for developing the theory of black hole radiation. 
Answer: Although the name of Hawking is commonly attached to the theory of black hole radiation 
and black hole entropy and temperature, the roots of these ideas seem to originate from the work of 
Bekenstein. However, the detailed quantum mechanical application and development of formalism of 
these ideas seem to belong mainly to Hawking (although Hawking was initially against the idea of 
black hole radiation). Anyway, we think the credit for the theory of black hole radiation should be 
shared to include Bekenstein (and possibly others). 

2. What is the temperature of a black hole with (a) 10 Earth masses and (b) 0.1 Earth mass? 
Answer: 
(a) We have M ~ 5.972 x 107° kg and hence: 


hee 6.62607 x 10-*4 x (3 x 108)” 


T= ee ~ 0.00206 K 
8tkGM 167? x 1.38065 x 10-73 x 6.674 x 10-!! x 5.972 x 1075 


(b) We have M = 0.01 of the mass in part (a) and hence the temperature should be scaled by a factor 
of 1/0.01 = 100. Therefore, T ~ 0.206 K. 


Exercises 


1. What is the essence of Hawking radiation and its basic mechanism? 

2. Analyze Hawking radiation formula for the temperature of black hole. 

3. Derive a formula for the power radiated by a black hole assuming it is a black body and analyze it. 
4. It is claimed that classical black holes violate the third law of thermodynamics. Try to justify. 


8.8 Geodetic Effect 


Geodetic effect (which is also known as geodetic precession or de Sitter effect or geodesic gyroscope 
precession) is another characteristic prediction of general relativity that has no parallel in classical gravity. 
The effect is simply based on the fact that the spacetime of general relativity is curved Riemannian, unlike 
the space of classical gravity which is flat Euclidean, and hence when vectors are parallel-transported in 
the spacetime they will be affected by the curvature of spacetime. To be more clear, let have a gyroscope 
(see Exercises) in a flat Euclidean space that moves freely around a closed path in the space, and assume 
that the gyroscope starts its journey from an initial position P, where its spin (or angular momentum) 
vector points into a fixed direction (say the direction of a given vector V,). Due to the conservation of 
angular momentum in the absence of external torques the gyroscope spin vector will keep pointing into 
the same initial direction V, during its entire journey and hence when it returns to its initial position 
P, the final direction of its spin vector will be the same as its initial direction V,. In other words, 
the angular momentum vector of the gyroscope is parallel-transported along its closed trajectory in an 
absolute Euclidean sense of parallelism. 

Now, let the gyroscope be moving freely in a curved Riemannian space (i.e. the spacetime of general 
relativity) where it performs a similar journey around a similar closed path. Since the space is curved 
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CP CP 


Figure 9: A gyroscope G moving freely around a closed path CP where on the left the gyroscope is in a 
flat spacetime and hence it returns to its starting point without change of its spin vector V, while on the 
right the gyroscope is in a curved spacetime due to the presence of a gravitating body GB and hence it 
returns to its starting point with change of its spin vector V; to V2 (where a is the precession angle). 


then by parallel-transporting|?47) the angular momentum vector along the closed trajectory the direction 
of the gyroscope spin will not be preserved in a Euclidean sense. In other words, the final direction of the 
gyroscope spin vector is not the same as its initial direction (refer to § 2.8 and see Figure 9). Accordingly, 
if we have for example a gyroscope on board a satellite orbiting the Earth then after each revolution the 
gyroscope spin vector will deviate slightly from its previous initial direction. In fact, the gravitational 
field of the Earth is very weak and hence the curvature of the spacetime is very small and therefore the 
deviation in each revolution is very tiny. However, after many revolutions the effect of these individual 
deviations should accumulate and become measurable (if this effect does exist). 

It is shown in the literature that the angle of precession per revolution (assuming Schwarzschild geom- 


etry) is given by: 
M\ 2 
i 5 b-(-% ) | (305) 


Cr 


where a is the angle of precession in radians, i.e. the angle through which the spin vector of the gyroscope 
is rotated in the direction of the orbital motion in the plane of the orbit in one revolution around a 
gravitating body of mass M following a circular orbit of radius (or radial coordinate) r. 


Exercises 
1. Give a brief definition of gyroscope. 


2. Find the precession angle experienced in one revolution by the spin vector of a satellite gyroscope 
skimming the Earth in a circular orbit at very low altitude. What is the precession angle in one year? 


8.9 Frame Dragging 


This effect (which may also be called inertial frame dragging or dragging of inertial frames) is a purely 
general relativistic effect where a rotating massive body!?48 is predicted to distort the metric of the 
spacetime in its surrounding by its rotation resulting in modifying the geodesic trajectories (or world 
lines) by rotation. So, unlike classical gravity where gravitation depends on the mass of the gravitating 


[247] Ty fact, parallel-transporting is the requirement of the conservation of angular momentum in the absence of external 
torques. 
[248] In fact, rotation is an instance of motion of the gravitating object which drags inertial frames in its vicinity. 
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object but not on its rotation, in general relativity the rotation of the gravitating object has direct impact 
on the geometry of spacetime which leads to gravitational effects. For example, a massive body falling 
from rest along a radial direction toward a massive rotating object (e.g. Kerr black hole) is expected to 
be dragged by the rotation and hence its motion will gain a non-radial (or azimuthal) component. 

In more technical terms, a freely falling particle in the field of a rotating gravitating object at the center 
of a standard spherical coordinate system?! (with the @ = 0 axis representing the axis of rotation and ¢ 
representing the azimuthal coordinate that quantifies the rotation) is expected to experience a variation 
of its azimuthal coordinate ¢ by a variation in its radial coordinate r (i.e. ae # 0) and hence the particle 
is dragged by an angle Ad ¥ 0 as it falls radially toward the gravitating object. This is obviously in 
contrast to the classical gravity where no such variation is predicted and hence ae = 0 and Ad =0. 

The prominent example of frame dragging is the so-called Lense-Thirring effect (or Lense-Thirring 
precession) where gyroscopic precession is predicted to occur due to frame dragging caused by a rotating 
gravitating object like the Earth (see § 9.9.2). This may be demonstrated by a gyroscope in a satellite 
that orbits the Earth in a polar plane (i.e. a plane that embeds the axis of rotation of the Earth) where 
frame dragging will force the orbital plane to rotate around the Earth causing the gyroscope to precess 
(see Exercises). 

Exercises 


1. Explain frame dragging in a few simple words. 

2. Why is frame dragging called inertial frame dragging? 

3. Explain, with a simple sketch, gyroscopic precession caused by frame dragging in a satellite orbiting 
the Earth in a polar plane. 

4. What feature of the Kerr metric relates to frame dragging? 


8.10 Wormholes and Other Fantasies 


Wormbholes in spacetime are supposed to be one of the consequences and predictions of general relativity. 
In fact, wormholes is just an example of many bizarre ideas in modern physics whose roots originate from 
general relativity. We believe that wormholes and their alike are fantasies and utter illusion (in fact they 
are typical examples of how modern physics can become absurd). This sort of ridiculous science should 
point out to potential defects in the theory of general relativity itself since it leads to such nonsensical 
consequences and predictions. We believe no one should waste his time in these futile investigations and 
hence the interested reader in wormholes and their alike should refer to the literature of general relativity. 


8.11 Cosmological Predictions 


There are many cosmological consequences and predictions that are based on general relativity (as part 
of the so-called relativistic cosmology). In fact, modern cosmology is almost entirely based on general 
relativity. However, almost all these consequences and predictions are theoretical in nature and utterly 
hypothetical with no observational basis since they represent speculations and contemplations about the 
Universe and its origin and destiny and hence they are a strange mix of philosophy and mathematics with 
some science (mainly physics and chemistry). Moreover, large parts of these consequences and predictions 
are based on the alleged Big Bang and hence the result is not expected to be more credible than its 
genesis. In fact, modern cosmology cannot be regarded as a scientific discipline in a strict sense due to 
the involvement of many non-scientific elements and the contamination with metaphysical ideas and wild 
speculations which have no observational ground. Furthermore, most of its predictions and results are 
scientifically unverifiable and hence they cannot be classified as science (see § 1.9). Therefore, we should 
not waste our time in these useless investigations and speculations. The interested reader should consult 
the literature of general relativity and relativistic cosmology about these nonsensical details. 


[249] We are considering only the angular part of the system (ie. the 0 and ¢ coordinates). 


Chapter 9 
Tests of General Relativity 


The subject of this chapter is the experimental verification and observational evidence in support of 
the theoretical consequences and predictions of general relativity that were investigated in the previous 
chapters (and the last chapter in particular). We should note that the tests of general relativity are 
commonly classified into several categories. The tests are initially classified as classical and non-classical 
where the classical refer to those tests that were proposed by Einstein (i.e. perihelion precession of 
Mercury, light bending by the Sun and gravitational red shift of light). Some may also add to the list of 
classical tests the tests that are similar in their working principle to these three tests such as gravitational 
time dilation and Shapiro time delay tests. The list of classical tests may also be extended by some to 
include tests of special relativity since according to the equivalence principle of general relativity special 
relativity holds locally in freely falling non-rotating frames and hence any valid test to the theory of special 
relativity should be a valid test to general relativity. 

The non-classical tests are also classified into several categories where different authors generally adopt 
different categorization plans. For example, some authors classify non-classical tests into modern tests 
(such as frame dragging tests), strong field tests (such as black holes observations) and cosmological tests 
(such as gravitational lensing) while other authors classify non-classical tests into satellite based tests 
(such as geodetic precession tests), astronomical tests (such as black holes observations) and gravitational 
waves tests (such as LIGO observations). 

In fact, many of these categories are not mutually exclusive (or disjoint) and hence we may find a 
single test listed (or can be listed) into two categories or more. Moreover, some of these categorization 
plans do not seem comprehensive. Anyway, most of these categorization plans are based on somewhat 
arbitrary criteria (e.g. practical or historical criteria or personal preference and convenience) rather than 
fundamental criteria. In fact, some of these categorization plans are so arbitrary and uninformed that 
they can be described as nonsensical and illogical. Accordingly, we do not see real benefit or value in 
adopting any one of these categorization plans or creating a new categorization plan. Therefore, in the 
following sections we include the main tests (as well as alleged tests) of general relativity in a single list (or 
category) considering in the sectioning and ordering a broad range of reasons such as clarity, chronology 
and convenience. In fact, we are generally guided by the structure of the previous chapter although that 
structure is partly based on the intended structure of the present chapter. 


Problems 


1. Discuss the factors that should be considered in determining the value and significance of any evidence 
in support of a theory about a particular physical phenomenon such as gravity. 
Answer: We think the following factors (among other factors) should be considered when assessing 
the value and significance of any evidence in support of a scientific theory: 
(a) Whether there is or there is not another theory that can explain the phenomenon. Accordingly, the 
value of the evidence should diminish if there is another theory because the evidence is not exclusive 
to the theory and conclusive about its validity since it can be explained by the other theory. In other 
words, even if the theory is wrong the evidence can still be explained by the other theory and hence 
the evidence does not necessarily imply the correctness of the theory. 
(b) Whether the evidence does exist before or after the creation of the theory. Accordingly, if the 
evidence does exist before then its value should diminish because the theory has been (or at least 
could have been) created and tuned to match the evidence. 
(c) Whether the evidence is correlated to other evidence. For example, time dilation and frequency 
shift are correlated effects (see § 6.3.4) and hence the value of the evidence on each one of these effects 
should diminish since they both relate to the same aspect of the theory and hence in this sense they 
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are not independent of each other. In other words, they should be regarded as a single type of evidence 
and not as two different types of evidence. 

(d) Whether there is bias for or against the theory or not. Accordingly, the value of the evidence 
should diminish (or increase) if there is bias in favor of (or against) the theory because the likelihood 
of prejudice and error should increase (or decrease) in this case. In fact, general relativity is a typical 
example of a theory with bias in its favor. 

(e) Whether the evidence is based on trust or not. For example, a theoretical evidence that any one 
(with appropriate qualification) can check does not depend on trust while the result of an exotic ex- 
periment or observation depends on trust because no one (except the experimenter or observer) has 
access to the experiment or observation to verify its authenticity and validity. 

(f) Whether the evidence requires the full formulation of the theory or not. Accordingly, an evidence 
that can be explained by an independent rule or principle within the framework of the theory without 
need to the full formulation is less significant in its endorsing value because the full formulation of the 
theory is not needed to justify the evidence and hence even if the full formulation of the theory was 
wrong we can still explain and justify the evidence. 

(g) Whether the physical setting from which the evidence is extracted is sufficiently simple or too 
complicated and hence how many factors are involved. This is because when the system is sufficiently 
simple then the sources of error and uncertainty are less and the result will usually lend itself to a 
rather simple, clear and conclusive analysis while the situation will be different in the opposite case. 
Accordingly, the value of scientific evidence should decrease as the complexity of the physical system 
and the experimental and analytical tools increases due to the increased probability of uncertainty and 
error in the results as well as in the analysis of the results. In fact, the physical systems from which the 
general relativistic evidence are extracted are generally very complex and they require very complex 
experimental and theoretical tools and hence their value should diminish from this perspective. 

(h) Whether the results belong to a single theory or they represent combined effect of two (or more) 
theories. Accordingly, the value of any evidence should diminish when the results represent combined 
effect of more than one theory because we do not know what is the actual contribution of each theory 
to the observed combined effect and hence even if the observed combined effect agrees with the theo- 
retically predicted combined effect the results should still be ambiguous about the endorsement of the 
theories individually and separately. For example, if the theoretical prediction attributes 50% of the 
combined effect to theory 1 and 50% to theory 2, it is still possible that the actual contribution of the 
effect of theory 1 is 25% while the actual contribution of the effect of theory 2 is 75% and hence the 
evidence is still inconclusive about endorsing the theories. In fact, when the results represent combined 
effect of more than one theory we may need the applicability of the superposition principle (i.e. the 
effects of the individual theories are additive) and this may not be evident or well established. 

(i) Whether the evidence is obtained from a physical system whose state is described and modeled 
by tentative theoretical models and subject to hypothetical conditions and assumptions. Accordingly, 
the value of any evidence obtained from such a system should not exceed the value of its origin and 
hence its value should diminish substantially. This is particularly true with regard to the evidence 
obtained from most astronomical and cosmological systems where these systems are subject to highly 
hypothetical models and tentative presumptions. So, the value of astronomical and cosmological evi- 
dence is generally less due to many uncertainties and sources of error (see exercise 6 of § 9.2). In fact, 
considerable amount of the claimed evidence in support of general relativity is obtained from such 
systems. 

(j) Whether there is a conflict in the results or/and in the opinions about the interpretation and 
significance of the results. For example, if a specific experiment or observation was conducted more 
than once (say by two scientific teams) and the two disagree about the results then the significance 
of the evidence (even if both results endorse the theory in some aspects) should diminish because the 
possibility of error (which should have already occurred somewhere) should increase due to the conflict. 
Similarly, if the result of a specific experiment or observation is (or can be) interpreted and evaluated 
differently by different opinions then the value of the evidence should also diminish due to the lack of 
clarity about the implication and significance of the result. 
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It is noteworthy that there are many other factors that contribute to the determination of the value 
of scientific evidence (some of which may be specific to each individual evidence) and hence the above 
list represents a sample of general factors that could serve as guiding principles in assessing the value 
of evidence in science. 


Exercises 


1. What is the significance of endorsing a theory by evidence that belongs to a particular aspect of the 
theory? What about having evidence against a particular aspect of the theory? 

2. Discuss the relation between the tests of general relativity and the tests of special relativity. 

What correct /incorrect theory means? 

4. Can we have correct predictions from wrong theories, and is this consistent with the rules of logic? 


. 


9.1 Perihelion Precession of Mercury 


Historically, this may be seen as the first “evidence” in support of general relativity because it emerged 
during the formulation of the theory. In fact, the formulation of the theory was tuned to match the 
observation and hence it should not be considered as “evidence” in the technical sense. Yes, the correctness 
of the theoretical “prediction” of general relativity to this effect means that the theory is compliant with 
the precession observations and hence it is correct in this regard. As discussed in § 8.1, the perihelion 
precession of Mercury is just an instance of the periastron precession that is predicted by general relativity 
to occur in orbital motion and hence this evidence is more general than Mercury although historically the 
precession of Mercury was the focus of this prediction. Accordingly, this evidence extends to include many 
instances where there is no “tuning to match observation” and hence it has predictive value. As we will 
see, there are claims of strong experimental evidence in support of the predicted relativistic precession 
in the orbital motion of binary star systems. Also, the precession of other planets in the solar system 
generally agrees with the predictions of general relativity although the agreement may not be as good as 
in Mercury. 

In our view, the precession of Mercury is seemingly the most convincing evidence in support of general 
relativity as a gravity theory. However, we may note the following about the significance of this evidence: 
e As indicated above, the theory was tuned to match this observation and hence the significance of this 
prediction should be diminished. It should be obvious that the significance of this evidence will be more 
important if this agreement came without knowing what we should expect. However, the subsequent 
agreement with other precession observations (i.e. of binary systems and of other planets of the solar 
system) should strengthen the significance of this evidence (although the agreement in most cases is not 
as good as in the case of Mercury). 

e It may also be claimed that the derivation of the perihelion precession formula from a classically-based 
method (i.e. the retarded gravitational potential theory of Gerber) should diminish the value of the 
endorsement of this evidence to general relativity. However, it is claimed that the retarded potential 
theory is an ad hoc fix rather than a complete and consistent theory. Unfortunately, we do not have first 
hand information about this theory to assess its validity and significance and if the theory is sufficiently 
general to predict other precession observations (i.e. other than Mercury). Anyway, being an ad hoc fix 
(even if this claim is correct) should not diminish its value as a potential competitor to general relativity 
in explaining the precession observation(s). It should also be noted that the precession formula was also 
obtained later from other non-classical gravitational theories (e.g. those of Whitehead and Birkhoff). 

e We should also note that there are more recent Newtonian formulations of the precession phenomenon 
in orbital motion (although they generally do not agree with the general relativistic prediction quantita- 
tively). Accordingly, the value of any evidence in this regard in endorsing general relativity specifically 
should be diminished due to the existence of these classically-based formulations which compete (at least 
qualitatively) with general relativity in explaining this phenomenon. 

e There may also be a possibility of “tuning” the observations to make a “perfect match” with the prediction 
of general relativity. We may use the following quote from Clemence (which is taken from Synge) to justify 
our concern: “The observations cannot be made in a Newtonian frame of reference. They are referred to 
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the moving equinox, that is, they are affected by the precession of the equinoxes, and the determination 
of the precessional motion is one of the most difficult problems of positional astronomy, if not the most 
difficult. In the light of all these hazards it is not surprising that a difference of opinion could exist 
regarding the closeness of agreement between the observed and theoretical motions”. This concern may at 
least be a basis for suspecting the existence of an exaggeration in the claimed accuracy of the agreement. 
In fact, there are other sources of concern about the claimed accuracy of the agreement such as the lack of 
relativistic treatment of the many-body problem and the lack of well determined reference for the rotation 
of perihelion (see for example Synge in the References). 

e The precession of periastron is also observed in certain binary pulsar systems and it is claimed that the 
evidence is conclusive in its agreement with the general relativistic prediction.°°! This should undisput- 
edly give more weight to this evidence in its support to general relativity. However, this should depend 
on the level of certainty in the parameters (like the mass of the orbiting objects) of the binary system 
and if these parameters are obtained independently of general relativity or not. We should also add the 
precession observations of other planets in the solar system which seem to endorse the general relativistic 
predictions although the agreement (in some cases at least) may not be as impressive as in the case of 
Mercury. 

e As discussed briefly in § 8.1, the “excellent” agreement between the general relativistic prediction and 
observation is the result of questionable derivation and calculation which are partly based on using classical 
results whose validity is restricted to flat spacetime and hence they partly ignore the effect of spacetime 
curvature.!?51]_ Although the effect of spacetime curvature is negligible in this classical case (as long as 
we respect the locality condition), the violation of the locality condition makes this effect considerable 
by accumulation. In fact, the large spatial extension of the orbit associated with the large time period 
of a century will make ignoring the effect of even a very tiny curvature unacceptable and will reduce the 
result useless (or even totally wrong). The reality is that the violation of the locality condition itself is 
what made this precession observable and considerable in this classical case, so we cannot consider the 
impact of the violation of locality in the effect itself but ignore it in the consideration of the curvature 
of spacetime. This is inline with our previous observation (see exercise 8 of § 8.1) that even with the 
consideration of the orbiting system as a classical system we still observe the extremely tiny precession of 
the orbit over extended space (of a planetary orbit) and long time period (of a century), so even the tiny 
effect of spacetime curvature should not be neglected over this extended space and long time period. In 
fact, qualitative and quantitative analysis should lead to the conclusion that the effect of the curvature 
of spacetime should be comparable to the predicted 43 arcseconds that we obtained from the implicit 
assumption of flat spacetime. This should put a big question mark on the validity of the perihelion 
precession in endorsing general relativity. This applies at least to the problem of the perihelion precession 
of Mercury although this should extend even to other precession problems, e.g. periastron precession 
of binary systems whose results seem to be obtained from a similar semi-classical method of derivation 
and calculation. In fact, some of these systems are not classical and hence the effect of the spacetime 
curvature and the impact of the violation of locality should be more serious and pronounced. To sum up, 
fully-relativistic analytical treatments and calculations for the perihelion/periastron precession problem 
are required for this evidence to be valid and conclusive in endorsing general relativity (or otherwise). 
The current treatments and calculations contain classical and flat-space elements and hence their validity 
and conclusivity are questionable. 

e Due to the involvement of several causes of precession (see § 8.1), we may need to assume the validity 
of the superposition principle (although this seems logical in this case). 

Exercises 


1. Is the general relativistic prediction about extra precession of orbits verified in the case of planets other 


[250] We note that in our derivation of the orbital precession formula we used some classical arguments and hence the derived 
formula may not be valid in non-classical gravitating systems. However, these arguments can potentially be generalized; 
moreover there are other methods for deriving the orbital precession formula in general (although they also contain 
questionable aspects). 

[251] This is demonstrated for example by employing flat-space formulae in the derivation and using physical variables, in 
place of coordinate variables, in the calculations (refer to problem 1 of § 8.1). 
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than Mercury? What about other orbiting systems? 


9.2 Light Bending by Gravity 


This may be seen as one of the most established evidence in support of general relativity. However, there 
are several factors that diminish its value as supporting evidence to general relativity: 

e Most observations (which are in the optical part of the spectrum) have very large uncertainty and 
hence they do not lend full support to the general relativistic prediction. Yes, there are recent claims 
of observations with high certainty that agree very well with the predictions of general relativity. These 
observations are related to the deflection of radio waves using baseline interferometric techniques. However, 
there are some sources of uncertainty about the source of emission as well as the detection technique and 
hence we wonder if the analysis of the data is sufficiently objective towards factors like these. 

e Light bending by gravity can also be predicted by the classical theory of gravity (see § 8.2) and hence it 
is not a decisive evidence for general relativity. So, qualitatively classical physics and general relativity are 
equal in their relation to the prediction of light bending and hence both theories can claim to be endorsed 
by this evidence.|252]_ However, from a quantitative perspective the predictions of the two theories are 
different since the classical prediction is half the general relativistic prediction. Nevertheless, considering 
the large uncertainty in the observations (which we indicated in the previous point), the evidence can 
still be shared by classical physics since the quantitative prediction of general relativity is not confirmed 
and hence the amount of bending may well be explained by classical gravity (at least in some cases). 
Moreover, there are claims in the research literature of classical derivations of the light deflection formula 
that agrees with the general relativistic formula and this could cast more doubts over the validity of the 
light bending by gravity as an endorsement to general relativity specifically. We should also note that 
light bending by gravity can also be predicted by non-classical metric gravity theories other than general 
relativity. 

e At least some of the proclaimed evidence for light bending by gravity (e.g. gravitational lensing) may 
be explained by non-gravitational causes. For example, some cases of alleged gravitational lensing may 
arise from purely optical causes such as refraction of light in the atmosphere of the lensing object or 
diffraction of light by the lensing object. This may be particularly true in the case of “Einstein ring” 
where the pattern is very similar to diffraction. In fact, some of the proclaimed evidence may even be 
explained (tentatively at least) by fortunate occurrences and accidents such as the accidental existence 
of very similar observed objects around the presumed lensing object. The reality is that many of these 
observational evidence rely on the validity of general relativity and hence they are not independent of 
the theory itself; in other words they are effectively circular. In brief, there is a huge bias in analyzing 
the alleged observational evidence since the observers (most of whom are staunch supporters of general 
relativity) assume the validity of this theory (implicitly at least) and hence when they analyze the evidence 
they already assume a gravitational cause and assume that the proper theory for conducting this analysis 
is general relativity. So, the purpose of any presumed analysis is to verify the quantitative agreement 
between the observation and the theory and in many cases this verification can be easily done by tuning the 
parameters (consciously or unconsciously) to achieve the match where the numerous sources of uncertainty 
and error in such observational evidence grant many degrees of freedom in manipulating the evidence 
(legitimately or illegitimately). In other words, when these evidencel?>*! are examined and analyzed 
individually and independently of each other and of the theory of general relativity most of them will not 
lend definite support to general relativity or favor this theory, but when they are investigated within a 
single context and under a single banner (i.e. light bending by gravity as predicted by general relativity) 
then the evidence will look convincing and even overwhelming. 


[252] As we will see, the prediction of light bending is not specific to general relativity or one of the achievements of Einstein 
as it is claimed in the literature. In fact, the deflection of light by gravitational field was predicted and formulated even 
in classical physics where work and results related to this issue are attributed to Newton, Cavendish and Soldner as 
well as other classical physicists (see Problems and refer to the literature about light deflection by gravity). 

[253] We use “these evidence” and its alike for clarity (which is of primary importance in scientific texts) even if it is not 
allowed linguistically. 
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e Considerable part of the claimed evidence in support of light bending by gravity has only qualitative 
value at most. For example, most or all observations of gravitational lensing do not provide quantitative 
evidence independent of the theory used in the analysis because in gravitational lensing most parameters 
(such as the mass of the lensing object) are guessed rather than measured or observed. In fact, this 
guessing extends in some cases even to the existence of the presumed lensing object because there is no 
independent evidence for the existence of the lensing object (such as black hole whose existence cannot 
be observed and verified directly) apart from the presumed gravitational lensing. Accordingly, even if we 
assume that the claimed evidence supports light bending by gravity qualitatively, it does not provide a 
conclusive quantitative evidence in support of a particular theory unless we assume a priori the validity 
of a particular theory, and this is obviously circular. In brief, gravitational lensing (and its alike) is 
at the best a qualitative evidence and hence it can be easily explained by other theories (classical and 
non-classical) and cannot be a conclusive evidence in support of general relativity specifically. 

e The general relativistic derivation of the formula of light bending by gravity does not necessarily require 
the full formulation of general relativity (as represented by the Field Equations) [254] and hence it cannot 
be seen as evidence for the full formulation of general relativity. In other words, the general relativistic 
formula can be obtained from simple arguments based on the weak equivalence principle and hence it does 
not require the full force of general relativity. Accordingly, even if we assume that this test is compliant 
with the prediction of general relativity it cannot be regarded as definite and specific evidence to the full 
formulation of general relativity although it can be regarded as support to some of the arguments and 
principles that general relativity rests upon. It should be remarked that the predictions of the derivation 
from the equivalence principle (according to some widely-accepted derivation methods) and from the 
full formulation differ quantitatively and hence from a quantitative perspective the agreement with the 
prediction of the derivation from the full formulation (or otherwise) should be a decisive factor. However, 
this does not apply to all the derivation methods from the equivalence principle. Moreover, many of the 
claimed evidence have only qualitative value and hence they should be indifferent with respect to the 
quantitative aspect. 

e The derivation of the relativistic light bending formula (i.e. Eq. 293) is based on certain approximations 
(see the Problems of § 8.2) and hence its general applicability is questionable. This means that even the 
highly reliable observations (e.g. the observations related to the deflection of radio waves) that supposedly 
endorse general relativity with high confidence and certainty can in fact endorse general relativity only 
within the limits of validity of these approximations. Therefore, these observations cannot endorse the 
predictions of general relativity in general (i.e. when the above approximations do not apply). 

e We should also mention that there are numerous sources of uncertainty and error in the light bending 
observations in general (especially those in the optical range) due for example to optical distortion by 
the instruments or to refraction by the Earth atmosphere. These factors should affect the certainty and 
reliability of this test (at least in some cases). 

To conclude, light bending by gravity which is seen as one of the best established evidence in support of 
general relativity is questionable in many aspects and details and hence it cannot be classified as conclusive 
evidence in support of general relativity despite the common claims in the literature about its validity 
and significance. 


Problems 


1. Provide a short historical account on the investigation by classical physicists (prior to general relativity) 
of light bending by gravity. 
Answer: In the literature of general relativity the deflection of light by gravity is commonly presented 
as one of the unprecedented achievements of this theory. However, this issue has been investigated by 
several classical physicists (as early as Newton). The available historical records show that Newton has 
contemplated about the deflection of light by gravity, which is inline with his corpuscular theory about 
the nature of light. The records similarly show that this issue has also been examined qualitatively 
by Jean-Paul Marat in the 1780s. The first quantitative calculation (or estimation) of light bending 


[254] We note that the geodesic equations are based on the space metric and hence they are ultimately a product of the Field 
Equations which produce the metric. 
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by gravity seems to belong to Cavendish in the second half of the 18°” century. The first confirmed 
derivation of the classical formula of light bending by gravity is attributed to Johann Soldner who 
published a paper in the early years of 1800s in which he did not only derive the classical formula but 
he also estimated correctly (according to the classical prediction and within the uncertainties in the 
physical constants at that time) the amount of deflection that should be caused by the Sun. Moreover, 
he suggested corrections to the observations of the positions of the fixed stars (if observed too close to 
the Sun) based on the errors in the apparent position due to the effect of bending. 

In brief, the issue of light deflection by gravitational fields has been investigated qualitatively and 
quantitatively rather thoroughly by classical physicists and hence it is not an invention or discovery 
or precedence of general relativity. Yes, the general relativistic prediction is quantitatively different 
from the classical prediction and hence if general relativity should be accredited it should be for this 
and only for this. However, in the light of the uncertainties about the evidence in support of the 
general relativistic prediction even this credit should be provisional and require more endorsement 
from observations. Moreover, there are non-relativistic predictions that agree quantitatively with 
the relativistic prediction and hence any credit will not belong to general relativity specifically and 
exclusively. 


Exercises 


1. Investigate bending of star light by the Sun and assess its value as one of the important evidence in 
support of general relativity. 

2. Discuss the historical development of the derivation of light bending formula within the framework of 
general relativity and the significance of this on the value of this evidence. 

3. Examine the possible reasons for the premature endorsement of general relativity by the results of the 
solar eclipse of 1919. 

4. Briefly discuss and assess gravitational lensing as one of the claimed evidence in support of general 
relativity. 

5. Talk about the tests of light bending by gravity in the radio band (rather than the optical band) of 
the electromagnetic spectrum. 

6. Investigate the sources of error and uncertainty in the results of astronomical and cosmological inves- 
tigations in general. 


9.3. Gravitational Time Dilation 


Gravitational time dilation may be regarded as one of the most compelling evidence in support of general 
relativity. In the following subsections we briefly investigate a number of experiments and applications 
that are based on the gravitational time dilation and hence they can be considered as direct evidence in 
support of this aspect of general relativity. We should remark that because of the correlation between 
gravitational time dilation and gravitational frequency shift (see § 6.3.4) most (if not all) of the claimed 
evidence of gravitational time dilation can be explained indirectly by classical gravity (and hence they are 
not conclusive evidence in support of general relativity) although time dilation does not exist classically. 
In other words, although theoretically and conceptually gravitational time dilation may not be possible to 
explain classically, it is possible to explain indirectly as gravitational frequency shift due to the correlation 
between time dilation and frequency shift at least in some cases. This is because in § 8.4 we derived a 
classical formula for gravitational frequency shift using the conservation of energy and it was shown 
there that the classical formula and the general relativistic formula produce virtually identical results in 
most cases since their power series approximations are identical under certain practical conditions and 
assumptions. So, at least some of the instances of gravitational time dilation evidence (which meet the 
conditions of the power series approximation and involve frequency as timing mechanism)|?>] can be 


[255] Referring to our investigation in § 8.4, we may need to add another condition that is for the gravitational time dilation 
to be possible to explain classically it must be based on gravitational frequency shift by sending a signal from one 
gravitational potential level to another potential level and not by generating two independent signals at two potential 
levels. In fact, by adopting this condition we essentially make a distinction between gravitational time dilation and 
gravitational frequency shift as two different phenomena and hence the significance of the evidence as time dilation 
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explained classically and hence they are not conclusive as general relativistic evidence. In fact, they can 
also be explained non-classically by modern non-relativistic gravity theories. We should finally remark 
that in most of the time dilation (and frequency shift) experiments the frames of occurrence or/and 
observation are not fixed in space (e.g. they are freely falling as in the case of global positioning system or 
they follow certain spatial trajectory as in the case of Hafele-Keating experiment) and this should require 
generalization of the argument and derivation of the gravitational time dilation (and frequency shift) that 
we presented earlier. |25° 


Exercises 


1. What are the cases in which gravitational time dilation cannot be explained classically? 


9.3.1 Hafele-Keating Experiment 


This experiment, which was conducted in 1971 by Joseph Hafele and Richard Keating, is based on mea- 
suring the combined effect of time dilation caused by motion (i.e. special relativistic) and by gravity (i.e. 
general relativistic) where precise cesium atomic clocks carried on board airplanes were used to detect 
time dilation by performing round trips around the world eastward and westward and comparing their 
time with the time recorded by identical clocks that stayed stationary on the Earth. The principle of the 
experiment is very simple that is the traveling clocks on board the airplanes should experience slowing 
down (or speeding up depending on the direction of travel) due to kinematical (or special relativistic) 
time dilation effect caused by the motion, and speeding up due to gravitational (or general relativistic) 
time dilation effect caused by the increase in altitude. Accordingly, the experiment is a simultaneous test 
to both relativity theories in one go as it is based on testing the combined effect of time dilation predicted 
by these theories and not on testing the individual effects of these theories. 

However, the validity and significance of the results of this experiment may be questioned for several 
reasons. For example: 
e Since this experiment is based on testing the combined effect of the kinematical and gravitational time 
dilation it is not clear about the effect of each individual cause. Therefore, the result is not conclusive 
about the endorsement of each individual theory even if the observed combined effect agrees with the 
theoretically predicted combined effect (assuming the validity of superposition as will be discussed in the 
next point) because the actual contribution of the effect of each theory to the combined effect is unknown. 
e Since this experiment is based on testing the combined effect of the kinematical and gravitational time 
dilation, we need to assume the validity of superposition (or additivity of effects, i.e. the total effect is 
the sum of the special and general relativistic effects) which is not obvious (see Problems).!?57] 
e There are question marks about the validity of the involvement of special relativity and its application 
in this experiment and its analysis. For example, the analysis seems to suggest the need for an external 
(or privileged) frame to make sense of the claimed special relativistic effects where the change in time is 
positive in one direction and negative in the other direction. 
e The marginal error in the reported data is considerable and hence the results are not conclusive. 
e The results may also be explained partially by classical gravity as discussed earlier although this can 
be challenged since gravitational time dilation in this experiment is not based on gravitational frequency 
shift. The results may also be explained partially by other gravity theories. 


In fact, there are many sources of challenge and refutation for this experiment and its rationale and 
analysis as well as its procedural aspects (see Exercises). In brief, the results of this experiment are 
neither specific nor reliable and hence this test is not conclusive in its endorsement to special relativity or 
general relativity (let alone both). An improved version of this experiment was conducted later where it 
is reported that the marginal error was reduced substantially. However, this may have eliminated some 


evidence should vanish or diminish. 
[256] We note that the inclusion of special relativistic effects should not be sufficient to account for the required theoretical 
treatment of the gravitational effects in non-stationary frames unless we assume superposition (which is not obvious). 
[257] The reader should note that the validity of superposition (even if it is established) is not sufficient to address the concern 
in the previous point about the combined effect. 
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causes of concern in the previous experiment but not all causes and hence in our view this test is still 
inconclusive and potentially wrong. 


Problems 


1. Assess the validity of the commonly-accepted analysis of the observations and experiments (such as 

the Hafele-Keating experiment) that involve a combination of special and general relativistic effects 
(and potentially other effects). 
Answer: In our view, this analysis is questionable (at least in some cases) because it is implicitly based 
on the assumption of the validity of superposition!?5%! which is not obvious in these situations. For 
example, the formulation of gravitational time dilation is based on employing stationary gravitational 
frames and hence this formulation may not be valid in dynamic gravitational frames. Similarly, the 
applicability of the special relativistic formulation in the presence of gravity may not be valid. In fact, 
the uncertainty about the validity of superposition should introduce more sources of uncertainty on 
the validity of the tests that involve combinations of more than one effect (particularly special and 
general relativistic effects), i.e. apart from the uncertainty in endorsing the individual theories. 


Exercises 


1. Make a list of potential criticisms to the Hafele-Keating experiment and its analysis. 
2. Discuss briefly the “general relativistic version” of the twin paradox. 


9.3.2 Shapiro Time Delay Test 


The theoretical foundation of this test, which was theoreticized in 1964 by Irwin Shapiro, is based on the 
claim that due to the gravitational time dilation!?°"! an electromagnetic signal passing by a massive object 
should slow down due to its descent and ascent in a gravitational potential well. Accordingly, a radar 
signal sent from the Earth and reflected by another planet (e.g. Mercury or Venus) should experience a 
delay if the signal passed by the Sun in its onward-return journey. Such a test has been carried out in 
a number of experiments where it is claimed that the results match well with the predictions of general 
relativity. 

We note that the theoretical foundation of this test may seem inconsistent with the alleged constancy 
of the speed of light. However, this can be refuted by the claim that the constancy of the speed of light in 
general relativity is local. In fact, even the claim that this test is based on the gravitational time dilation 
effect may be challenged because it is inconsistent with the meaning of time dilation as used in other 
contexts and the controversy about the theoretical basis of this time delay (see the upcoming discussion). 
In our view, this test and its claimed theoretical foundation and formulation are shrouded with many 
question marks and the literature in this regard is rather messy and unreliable. Moreover, large part of 
the literature in this regard (especially on the experimental aspects) is based on trust with no scrutinized 
analysis or examination. So, even if such a delay is actually observed as reported, its theoretical foundation 
and formal justification require further assessment and analysis. The experienced delay may be explained 
by a general relativistic aspect (and potentially non general relativistic aspect) other than the proposed 
cause (e.g. gravitational time dilation) and the two are not necessarily equivalent in their significance and 
implications (see next paragraph). 

In this context, we note that there is confusion in the literature about the cause of this delay and if it is 
gravitational time dilation (as classified above) or gravitational length “dilation” (or what is described as 
spacetime dilation which may be stated by some as lengthening of the light path due to the curvature of 
spacetime) or dependency of the speed of light on the strength of the gravitational potential or gravitational 
light bending.?°°! Although some of these causes may provide equivalent explanations, the validity of 


258] We may also call it “additivity of effects”. 

259] As we will see, this is one potential explanation of the Shapiro time delay. We think the effect of gravitational length 
contraction (and possibly other effects) should also be considered in the estimation of the time delay if gravitational 
time dilation is supposed to be involved in this effect. However, a correct analysis based on the spacetime metric should 
automatically consider both temporal and spatial effects (as embedded in the spacetime metric). 

260] These alleged causes are what we found in the literature and there could be more. Although some of these explanations 
may reduce to others, it is not always the case. We note that the original derivation of the formula of this time delay 


9.3.2 Shapiro Time Delay Test 194 


each one of these causes should be demonstrated explicitly and formally to identify which aspect of general 
relativity is allegedly endorsed by this test and assess the value of this claimed evidence. We should also 
note that the confusion about the cause of this delay seems to be reflected in the confusion about its 
formulation, as will be seen in the questions. 


Problems 


1. Discuss the reliability of the Shapiro time delay test in its endorsement to general relativity. 
Answer: We believe this test is not conclusive. Some reasons for this belief are: 
e The theoretical foundation and formulation of this test are not well established. In fact, there are sev- 
eral methods and formulae that claim to represent Shapiro time delay and these methods and formulae 
do not produce the same predictions. For example: (a) some use coordinate variables while others use 
metrical variables, (b) some employ isotropic coordinates while others employ Schwarzschild coordi- 
nates, (c) some use straight path while others use deflected path, (d) some employ correction for the 
Earth movement while others do not. There are also disputes about its theoretical foundation (noting 
that the theoretical foundation should affect the formalism); some of these disputes are indicated in 
the text,!?64 
e There are question marks about the consideration of the speed of light and how it enters in the 
theoretical foundation of this time delay. We also question the potential role of gravitational length 
contraction (which is a result of having coordinate variables and metrical variables; see § 6.2 and § 6.4) 
and if it should be considered in the analysis. These issues are not addressed properly and consistently 
in the literature and this gives many degrees of freedom in manipulating the theoretical foundation 
to suit general relativity and its supporters. In brief, among this lack of clarity it is easy to select 
or tailor a formulation or explanation or method of calculation that suits the endorsement of general 
relativity even if this is inconsistent with the framework of general relativity or other established facts 
and principles. 
e There are many sources of uncertainty and error in the technique of radar ranging which is used 
in this sort of time delay experiments. This technique is complicated and challenging and hence the 
experimental results are not as straightforward as might be imagined. This criticism may be less severe 
with regard to other versions of the Shapiro time delay experiments where signals emitted from a satel- 
lite or spacecraft (e.g. Cassini probe) are used in these experiments and hence the time delay of the 
grazing signals can allegedly be obtained more directly than in the case of radar ranging experiments. 
However, there are still significant sources of error and uncertainty even in these versions. 
e The time delay caused by gravity is a very tiny fraction of the total journey time of the signal in the 
round trip and hence the effect of any error or noise will be so significant that the result will not be 
conclusive (especially when considering the above theoretical and practical uncertainties). 
e The formulae and calculations of this test are generally based on the use of physical variables for 
what is supposed to be coordinate variables and this could introduce significant error considering the 
extended spacetime interval of the onward-return journey even though the curvature of spacetime is 
small. 
e The reported (high) accuracy of the experiments of this test requires unrealistically high certainty 
about the distances involved and this is almost impossible to achieve in such dynamic orbiting systems. 
e Other effects (e.g. optical such as the refraction of the signal by the solar corona) are involved in 
these experiments and this should increase the uncertainty and error in the results (especially when 
considering the tininess of this time delay within the overall trip time). 


Exercises 


1. Give two formulae that allegedly represent Shapiro time delay and compare their quantitative predic- 
tions. 


by Shapiro is based on assuming the dependency of the speed of light on the strength of the gravitational potential in 
a Schwarzschild spacetime. 

[261] We should also note that there are technical reservations on the derivation methods of some of the time delay formulae 
(the details are out of scope). 
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2. Give some examples of expected sources of uncertainty and error in the Shapiro time delay test and 
assess their effect. 


9.3.3. Gravity Probe A 


This is a space mission conducted by the Smithsonian Astrophysical Observatory (SAO) in collaboration 
with the National Aeronautics and Space Administration (NASA) in 1976 to test gravitational time 
dilation where a probe carrying a hydrogen maser of very precise frequency was launched to a height of 
about 104 kml?®?! above the surface of the Earth. The working principle of this experiment is that due to 
the difference in the gravitational potential at the surface of the Earth and at 10+ km above the surface 
of the Earth the rate of time flow should differ (i.e. it is faster at 104 km above the surface of the Earth 
since it is higher in the potential well of the Earth) and this should be reflected in a change of frequency 
due to the reciprocal relation between the periodic time and frequency (see § 6.3.4). Accordingly, this 
may be seen as a test to the gravitational frequency shift as well as to the gravitational time dilation due 
to the fact that these two phenomena are based on the same physical principle (as explained in § 6.3.4) 
and hence they are essentially the same. 

It is claimed that the experimental results confirmed the predictions of general relativity to a high level 
of accuracy. However, even if we accept this claim in principle the evidence may not be conclusive in its 
endorsement to general relativity specifically since the results may be explained classically by gravitational 
frequency shift as a consequence of energy conservation (see § 9.4.3) although this explanation can be 
challenged because there is no actual frequency shift in this experiment but a generation of two independent 
signals at two gravitational potential levels (see § 8.4). We should also note that this experiment may 
be explained by non-classical gravity theories other than general relativity. In fact, it may be explained 
even by the equivalence principle without need for the full formulation of general relativity. Hence, it is 
not conclusive in its endorsement to the formulation of general relativity specifically and definitely. The 
reader is also referred to § 9.4.3 for further discussion and details about this experiment and related issues. 


Exercises 


1. What is the fractional change in the time interval between a clock on the surface of the Earth and 
a clock at an altitude of 10* km above the surface of the Earth (i.e. as in the Gravity Probe A 
experiment )?/?63] 


9.3.4 Global Positioning System 


Global positioning system (GPS) consists of a number of satellites positioned at high altitude (~ 20200 
km) around the Earth. The principle of operation of the GPS is based on satellite ranging where precise 
position and time signals are sent from the satellites and received by the users on the Earth to determine 
the position of the user (as well as other information like time and velocity). Accordingly, clocks on 
the satellites are used in conjunction with a triangulation-like method to determine the position on the 
surface of the Earth and in its proximity (e.g. in marine and aeronautical navigation). For the system 
to work reliably and determine positions accurately the clocks must be very precise to provide accurate 
and synchronized timing (and hence positioning) information.?°] Now, according to the gravitational 
time dilation of general relativity the clocks at high altitude run faster and hence regular corrections are 
needed to keep the timing accurate (i.e. synchronized with the timing on the surface of the Earth) and 
the system functioning. It is claimed that without the gravitational time dilation corrections of general 
relativity the GPS will become useless within a few minutes or hours due to the accumulation of timing 
errors and hence positioning errors. Accordingly, the GPS is regarded as a live evidence for the validity 


[262] Some sources in the literature report different height. However, the above figure seems more credible. 

[263] The question is restricted to the change caused by gravitational time dilation due to difference in height (or potential 
level) and hence it does not include other possible causes. 

[264] Ty talking about things like preciseness, accuracy, correction and synchronization we are essentially considering the 
timing on the surface of the Earth and the intended function of the GPS which is designed and deployed to serve its 
users on the Earth. Otherwise the satellite clocks are working properly within their physical environment without error 
or malfunctioning. 
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of general relativity. It should be remarked that special relativistic time corrections (due to kinematical 
time dilation) are also needed to keep the GPS working (see the upcoming discussion and Exercises). We 
should also expect other corrections related for example to the Shapiro time delay due to the descent of 
the satellite signals in the gravitational potential well of the Earth. 

Anyway, we have a number of reservations about the validity and significance of this test. For example, 
the test is supposed to involve both special and general relativistic effects (i.e. kinematical and gravita- 
tional time dilation) and hence it cannot be conclusive in endorsing any particular theory because we do 
not know what is the actual contribution of each theory (assuming that both theories have contribution) 
to the observed time difference between the satellite clock and the ground clock. In fact, the situation 
is more complicated due to the involvement of other effects and required corrections as indicated above. 
We also need to assume the applicability of superposition, i.e. the resultant effect of time change is the 
algebraic sum of the individual contributions of the effects of each theory (as if the other theory is not in 
action). The least that we can say about the validity of this assumption is that it is not obvious and hence 
it requires justification. Another reservation is that the applicability of special relativity is questionable 
because according to special relativity in its Einsteinian interpretation each clock (i.e. satellite clock and 
ground clock) should experience kinematical time dilation relative to the other clock. In fact, this should 
lead us to the controversy of twin paradox which was thoroughly investigated in B4 and hence we do not 
repeat the investigation here although we should mention that some of the alleged refutes of the twin 
paradox do not apply here (as indicated earlier in a similar context within the answer of some exercises). 
Moreover, the inertiality of the satellite and ground frames is questionable. We should also repeat what 
we stated before about the possibility of classical explanation for the gravitational time dilation based on 
the frequency shift and the conservation of energy although this can be challenged since what we actually 
have is two signals generated at two potential levels by two separate clocks. Anyway, alternative explana- 
tion may be provided by non-classical gravity theories other than general relativity even if we reject the 
classical explanation. The explanation may also be provided by the equivalence principle without need 
for the full formulation of general relativity. 

So in brief, we believe that this test is invalid or at least it is inconclusive. We can even claim that this 
test can be an evidence against at least one of the relativity theories, i.e. it can be an evidence against 
special relativity since time dilation takes place in a privileged frame (i.e. satellite frame) and not in the 
other frame (i.e. ground frame) which is a clear violation of the relativity postulate of special relativity in 
its Einsteinian sense. In fact, the collapse of this evidence in its special relativistic aspect should lead to 
its collapse in its general relativistic aspect because the observed combined time dilation effect (which is 
claimed to match the combined predictions of these theories) depends on the involvement of both theories 
in this observed combined effect and hence the collapse of the contribution of one theory to this combined 
effect should lead to the collapse of the contribution of the other theory. 

Finally, we should remark that there are claims in the literature that the time corrections in the GPS 
are based on empirical (rather than relativistic) formulae and the two are not identical. If so, then the 
global positioning system could lose its status in endorsing the relativity theories and could even be an 
evidence against the relativity theories. Unfortunately, we are not in a position to confirm any one of 
these conflicting claims because we have no detailed knowledge about the design and operation of the 
global positioning system. Hence, at the best the validity of this test (like most other tests) is a matter 
of trust and faith (assuming that its theoretical basis is sound). 


Problems 


1. Give a brief account of how the global positioning system works. 
Answer: The satellites of the global positioning system regularly transmit electromagnetic signals 
carrying information about their time and position in space. A receiver on the surface of the Earth (or 
in its surrounding such as airplane) uses the information from a number of these satellites. Hence, by 
knowing the location of the transmitting satellites and the time required by the signals to reach the 
receiver it can work out its position relative to the satellites and from this the position of the receiver 
on the Earth can be determined since the position of the satellites relative to the Earth is known. In 
brief, the satellites of the global positioning system work as reference points (or markers) in the space 
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to facilitate the determination of the position of the receiver on the Earth through the information 
about their position and the time required by the transmitted signals to reach the receiver. 


Exercises 


1. Calculate the general relativistic time correction that is required to keep the GPS functioning.|?°! 

2. Calculate the special relativistic time correction that is required to keep the GPS functioning. Comment 
on the method of calculation. 

3. Considering the results of the previous two exercises, find the required combined correction due to 
relativistic effects.|?°6 


9.4 Gravitational Frequency Shift 


Gravitational frequency shift may also be considered as one of the compelling evidence in support of general 
relativity. In the following subsections we briefly investigate a number of experiments and observations 
that allegedly verify the gravitational frequency shift of general relativity. However, before that we should 
draw the attention to the following remarks: 

e Gravitational frequency shift in general relativity is based on gravitational time dilation (see § 6.3.4) and 
hence it is not an independent evidence. This should diminish the value of this evidence since in essence 
it does not support an aspect of general relativity other than gravitational time dilation. In brief, all the 
claimed evidence in support of gravitational frequency shift are no more than instances of gravitational 
time dilation (and hence they do not represent a different type of evidence). The reverse is also true in 
general (or at least in most cases) due to the general use of frequency for timing because timing is usually 
based on using repetitive frequent physical phenomena like pendulum swing and atomic emissions.|?°7! 

e As we saw in § 8.4, the classical and general relativistic formulations of gravitational frequency shift 
are different but their quantitative predictions are very similar in most cases!?68l and this should cast 
a shadow over at least some of the claimed gravitational frequency shift evidence in support of general 
relativity because these predictions cannot be distinguished practically from the classical predictions and 
hence the evidence is not conclusive in its endorsement to general relativity specifically since it can be 
justified classically (noting also that it may be justified by other modern gravity theories). In fact, this 
should also cast a shadow over at least some of the claimed gravitational time dilation evidence in support 
of general relativity because of the close connection between gravitational time dilation and gravitational 
frequency shift (as explained in the previous remark and in numerous places in the previous sections and 
chapters) and hence most (if not all) of the time dilation evidence may be regarded as frequency shift 
evidence and vice versa (although this may be disputed at least in some cases). 

e Based on the previous remark and referring to our discussion in § 8.4 (see exercise 5 of § 8.4), if en <1 
then the gravitational frequency shift formula of general relativity at infinity (i.e. rg — co) is practically 
indistinguishable from its classical counterpart. In fact, this is the case in most astronomical observations 
of gravitational frequency shift. Therefore, for the gravitational frequency shift evidence in astronomical 
observations to be potentially conclusive in its endorsement to general relativity specifically the evidence 
should belong to a situation where the condition en <1 does not apply (by considering black holes and 
possibly neutron stars). 

e We should also note that the gravitational frequency shift is essentially a test for the equivalence principle 
rather than the Field Equation and this view is expressed by some general relativists. For example, 
according to Rindler (see the References) “The well-established gravitational Doppler shift, occasionally 


In fact, the question is about the main general relativistic correction caused by gravitational time dilation due to 
difference in height or potential level (noting that other general relativistic corrections are required). 

In fact, the question is about the main (not all) relativistic effects (i.e. the effects investigated in the previous exercises). 
We note that timing may be achieved by non-repetitive processes such as assigning (according to Newton’s first law) 
equal time periods to equal distances traversed by free particles (although the non-repetitivity of this process may be 
questioned as well as the axiomatic foundation of this timing technique). In fact, there are many details and potential 
debates and controversies about these issues (which we do not investigate). 

In fact, it was shown in § 8.4 that power series approximations of the classical and general relativistic formulations (plus 
reasonable conditions and assumptions which are valid in most cases) produce identical formulae. 
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also referred to as a crucial effect, is, in fact, not a test of the field equation but merely of the equivalence 
principle - at least in lowest order, which is all that can be observed at present”. In fact, there are many 
interesting details about this issue which the inquisitive reader may wish to investigate (see for Example 
Weinberg and Rindler in the References). 


Exercises 


1. Justify the fact that the correlation between gravitational time dilation and gravitational frequency 
shift should diminish the value of the evidence related to these effects in endorsing general relativity. 

2. What is the significance of having a classical formulation for the gravitational frequency shift that is 
identical to the general relativistic formulation under a power series approximation that is acceptable 
in most cases? 

3. If gravitational time dilation and gravitational frequency shift are equivalent then why should we 
investigate these effects (in this chapter and in the previous chapters) separately? 

4. Discuss the conventional nature of the physical definition and determination of time. 


9.4.1 Gravitational Red Shift from Astronomical Observations 


Red shift of electromagnetic emissions from astronomical objects, such as white dwarfs, has been observed 
regularly in astronomical investigations and it is commonly attributed in many cases to gravitational cause 
in accord with the prediction of general relativity. Some of these claimed instances of gravitational red 
shift go back to the 1920s where a spectral line from the companion of Sirius was observed red shifted 
and this was explained gravitationally and hence it was regarded as endorsement to general relativity 
(although this was discredited later). There are many other subsequent observations that are similarly 
explained and hence they are alleged to endorse general relativity. 

However, in many cases this evidence is not conclusive due to the many uncertainties in these obser- 
vations and in the physical characteristics of the emitting objects where these characteristics (such as 
mass and radius) enter in the estimation of the gravitational red shift. In fact, some of these claimed 
instances can be circular in their indication since the observed emissions are used to infer the physical 
characteristics of the emitting objects and hence these characteristics (even though they may be obtained 
from physically-similar but different objects) cannot be used in the estimation of the gravitational red 
shift. 

In brief, we have several reservations on the validity and conclusivity of the gravitational red shift in 
astronomical observations as an endorsing evidence to general relativity; these reservations include: 

e There are many observational and theoretical sources of uncertainty and error in these observations and 
their theoretical foundation and methods of analysis (as it is the case in most astronomical observations). 
One demonstration of this fact is the contradiction between some observations and other similar obser- 
vations (refer for example to the huge difference in the Sirius observations and the large uncertainties in 
the results as reported in the literature). 

e Some of these observations may be explained by non-gravitational causes (e.g. kinematical such as 
Doppler shift or moderation by interstellar medium). In fact, such a possibility should ruin the significance 
of many claimed instances of this evidence even if we accept in principle the theoretical foundation of the 
general relativistic gravitational red shift. 

e Most of these observations (assuming they are certain and are of gravitational origin) can also be 
explained by the classical formulation of the gravitational red shift (as discussed earlier) and hence they 
are not definite in their endorsement to general relativity. These observations can also be explained by 
metric gravity theories other than general relativity (and possibly even by the equivalence principle with 
no need for the formalism of general relativity). 


9.4.2 Pound-Rebka Experiment 


The essence of this experiment, which was conducted in 1959 by Robert Pound and Glen Rebka, is to 
measure the frequency shift experienced by gamma rays traveling between the top and bottom of a tower 
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at Harvard University over a distance of about 22.5 meters in the gravitational field of the Earth.!?°! It is 
reported that this experiment verified the prediction of general relativity (i.e. blue shift in the descending 
rays and red shift in the ascending rays) to high level of accuracy.!?7°! Hence, this experiment is regarded 
as one of the most conclusive evidence in support of the gravitational frequency shift and the theory of 
general relativity. 

However, as we will see in the Exercises the classical formulation of the gravitational frequency shift 
provides very similar quantitative prediction to the prediction of general relativistic formulation and hence 
the results of the Pound-Rebka experiment are not conclusive in endorsing general relativity specifically 
since the results can be similarly explained by the classical formulation. Moreover, the results can also be 
explained by other metric gravity theories (and possibly even by the equivalence principle without need 
for the full formulation of general relativity). 


Exercises 


1. Derive a simple classical formula for the expected frequency shift in the Pound-Rebka experiment. 

2. Referring to the answer of the previous exercise, justify why high-frequency radiation (like gamma ray) 
should be desirable to use in experiments like the Pound-Rebka experiment. 

3. Use the available data about the Pound-Rebka experiment with the classical formula that was derived 
in exercise 1 to estimate the frequency shift and the fractional frequency shift in the Pound-Rebka 
experiment. 

4. Repeat exercise 3 using this time the general relativistic formula for the gravitational frequency shift 
that was derived in § 6.3.4. Comment on the results. 

5. Repeat exercise 3 using this time the classical formula for the gravitational frequency shift that was 
derived in § 8.4. Comment on the results. 


9.4.3 Gravity Probe A 


This test was investigated earlier in § 9.3.3 from a gravitational time dilation perspective. However, we 
briefly re-discuss this test here from a gravitational frequency shift perspective to show that the results of 
this test may not be as significant in endorsing general relativity as it might be thought because similar 
quantitative results can be obtained from classical formulation (see Problems). In fact, this test may 
not be really a frequency shift test and hence the classical formulation (which may be restricted to the 
case of actual frequency shift) may not apply because as explained earlier we have in this experiment two 
independent signals generated at two different gravitational potential levels (i.e. we do not have a single 
signal that descends or ascends in a gravitational well) and hence the classical argument of frequency 
shift (which is basically based on the conservation of energy during the transition of a signal between two 
potential levels) may not be valid. This issue has already been investigated rather thoroughly (see for 
example § 8.4) and hence we will not discuss it anymore. 


Problems 


1. Estimate the gravitational frequency shift in the Gravity Probe A experiment using classical and 
general relativistic formulations. Comment on the results. 
Answer: We subscript the variables on the surface of the Earth with 1 and the variables at 10* km 
above the surface of the Earth with 2. In the following we use the fractional frequency shift in our 
estimation. 
According to the classical formula that was derived in § 8.4 we have: 
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[269] The Pound-Rebka experiment was more elaborate than what our description might suggest. The details (which are 
of little value to our purpose) can be found in the literature of general relativity. We should remark that there are 
some reservations on some aspects of the Pound-Rebka experiment other than being justifiable by other theories (and 
classically in particular). However, we see no necessity in going through these lengthy details. 

[270] Th fact, the accuracy of the original Pound-Rebka experiment was rather poor. But it was later improved substantially 
by Pound and Snider according to the reports. 
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According to the general relativistic formula that was derived in § 6.3.4 we have (noting that we use 
physical quantities for what is supposed to be coordinate quantities): 
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Comment: as we see, the classical and general relativistic results are identical (to the quoted ac- 
curacy).[?74] So, from a gravitational frequency shift perspective the results of this test are not as 
significant in endorsing general relativity as it might be thought because from this perspective the 
quantitative prediction of the general relativistic formulation can be obtained from the classical for- 
mulation. However, the applicability of the classical formulation in this experiment may be questioned 
(as explained earlier). 


9.5 Gravitational Length Contraction 


We are not aware of any alleged evidence in support of gravitational length contraction of general rela- 
tivity. One reason could be the alleged impossibility of observing length contraction (assuming it does 
exist). However, the proposed reasons and arguments in this regard (or at least their generality) are 
largely questionable and the investigations and discussions about this important issue are rather terse 
and superficial. In fact, there is even some mess and confusion in this part of the literature of general 
relativity which may indicate a crack in the theory. Anyway, even if this does not indicate a crack in the 
theory it may indicate that there are some aspects in the theory that are not well developed or may not 
be properly examined and assessed and this should put a question mark on the endorsement of the theory. 
Accordingly, it should be premature to assert that the theory of general relativity is endorsed even if all 


[271] In fact, they are also identical with the result of exercise 1 of § 9.3.3 which is about time dilation (as it should be). 
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the claimed evidence are valid and conclusive because these evidence are limited to certain aspects of 
the theory and hence many important aspects of the theory (such as length contraction) are still waiting 
verification. 

We may also claim that the existence of evidence in support of gravitational time dilation, but not 
gravitational length contraction, in addition to the existence of a classical formulation that can explain 
gravitational time dilation (i.e. indirectly through frequency shift) at least in some cases and circumstances 
may give the classical interpretation of these results more weight. In other words, we may not actually have 
gravitational time dilation (because if we have then we should also have gravitational length contraction) 
and hence all the effects that are attributed to time dilation may be explained by other causes such as 
frequency shift and conservation of energy. 


Exercises 


1. Try to justify the claim that observing gravitational length contraction is not as easy as observing 
gravitational time dilation (and it may even be impossible). 
2. What is the important issue that is highlighted by the previous exercise? 


9.6 Gravitational Waves 


There are claims of direct and indirect observation and detection of gravitational waves and the research 
in this field is going on vigorously. However, unequivocal confirmation is still sought despite the recent 
claims that definitely confirm the detection of gravitational waves from cosmic cataclysms (e.g. the 
collision of two black holes or the coalescence of two neutron stars) using laser interferometric techniques. 
We should remark that in principle gravitational waves may also be deduced from other gravitational 
theories (including classically-based theories) and hence they do not make a decisive evidence in support 
of general relativity unless the observations quantitatively agree with the predictions of general relativity 
and disagree with the predictions of other theories. So, up to the present time gravitational waves (even 
if they are really detected) cannot be regarded as a conclusive evidence in support of general relativity. In 
fact, gravitational waves should be a natural consequence of any full field theory with temporal and spatial 
dependency and hence gravitational waves, even if confirmed experimentally, should not be considered a 
decisive evidence in endorsing general relativity specifically but they should be seen as evidence for the 
tempo-spatial field model of gravitation. 

We should also note that there is a more fundamental challenge to the significance and value of grav- 
itational waves as evidence for general relativity because the current general relativistic theory of grav- 
itational waves is largely based on the linearized form of general relativity (see § 3.4) in the weak field 
approximation and hence there are theoretical objections to the endorsement of gravitational waves to the 
full non-linearized form of the theory.?77! In fact, on inspecting the literature of gravitational waves we 
can conclude that the general relativistic formulation of gravitational waves is theoretically problematic 
from multiple aspects (whether linear or non-linear). Hence, the available approaches for developing for- 
mulations for this phenomenon cannot lend full support to general relativity (even if these formulations 
are vindicated by observations) because these formulations are not entirely, genuinely and surely general 
relativistic since they are based on many approximations, twists and controversies. 

There are two main methods for verifying gravitational waves: indirect and direct. These methods are 
investigated and assessed in the following two subsections. 


9.6.1 Indirect Observation of Gravitational Waves 


Until recently, this was the only method for the detection of gravitational waves. Indirect observation is 
based on the principle of conservation of energy in conjunction with the relation between the periodic 
time of orbiting system and its total energy. According to the basic theory of this test, a binary orbiting 


[272] We should note that the recent research literature on the theory of gravitational waves includes claims of non-linear 
solutions of the relativistic gravitational waves problem. However, there are many ambiguities, obscurities and contro- 
versies about this issue and hence we can say that the currently-accepted theory of gravitational waves is based on the 
linearized general relativity. 
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systeml?73] of two massive objects (e.g. neutron stars) which loses energy by the emission of gravitational 
waves, should change its orbital period continuously (see Exercises). The famous (and possibly the only) 
indirect test of gravitational waves by this method is related to the PSR B1913+16 binary pulsar system 
which consists of two neutron stars of approximately equal mass of about 1.4 solar masses. This binary 
system was discovered in 1974 by Hulse and Taylor and was subsequently investigated by them thoroughly. 
According to the results of this investigation, which is largely based on a general relativistic framework, 
the orbital period of the system should decrease by about 70 pus per year?”4] due to the loss of energy by 
the emission of gravitational waves in excellent agreement with the observations (the reported uncertainty 
is less than 1%). 

We think the significance of this test in endorsing general relativity is highly exaggerated in the literature 
of general relativity. For example, the current general relativistic theory of gravitational waves is based 
on the linearized form in the weak field approximation and the applicability of this approximation in this 
case as well as its endorsement to the non-linearized form can be strongly challenged. We should also 
mention that there are many sources of theoretical and observational uncertainties (as well as potential 
circularity) in quantifying this binary system and extracting the reported results. Moreover, these results 
are partly based on some classical formulations and presumptions where the applicability of some of these 
should be questioned (because of the strong gravity of this system and/or because of the accumulation of 
relativistic effects due to violation of tempo-spatial locality). We should also consider the fact that being 
an indirect test in itself should diminish the value of this test as a decisive evidence for gravitational waves 
due to the possibility of the involvement of mechanisms other than gravitational waves in the shrinkage 
of the orbit and the decrease of the orbital period. In fact, this alleged indirect observation should at 
the best be regarded as circumstantial evidence. We therefore believe that this test is at least not as 
conclusive as it is depicted by general relativists. 


Exercises 


1. Why should the emission of gravitational waves by a binary orbiting system shorten the orbital 
period? 

2. Assess the claimed evidence of indirect observation of gravitational waves through the detection of 
changes in the orbital period of binary system. 


9.6.2. Direct Observation of Gravitational Waves 


Direct observation of gravitational waves is supposedly achieved (or expected to be achieved) by using 
gravitational wave detectors like LIGO (which is an acronym for: Laser Interferometer Gravitational- 
wave Observatory) and LISA (which is an acronym for: Laser Interferometer Space Antenna). In this 
context, we should also mention Virgo which is an Italian interferometer (based near the city of Pisa) 
that is dedicated to gravitational wave detection and it collaborates with LIGO in gravitational wave 
observations. These detectors employ Michelson-like laser interferometric techniques. The principle of 
operation of these detectors is that as the gravitational waves pass by the detector they cause changes 
in the geometry of spacetime in its vicinity and hence these waves introduce differential changes on the 
length of the arms of the laser interferometer which cause a measurable change in the interference pattern 
and its detectable effects. 

We should remark that the above-described method of detection by laser interferometric techniques is 
one of the two main methods for direct observation of gravitational waves. This technique belongs to 
what can be generically described as “free particle’ method of detection. The other main direct detection 
technique (which belongs to what can be generically described as “elastic body” method of detection) is 
based on the use of hefty metal bars equipped with very sensitive sensors to detect minute changes in the 
bar dimensions caused by the passage of gravitational waves. However, the latter detection technique is not 
sensitive enough to be of significant practical value (at least for the time being) in detecting gravitational 


[273] Being binary is not necessary, but it is the case in the system(s) used in this test. Anyway, it is a legitimate example 
and hence it is sufficient to demonstrate the principles. 
[274] We note that the rate of change should change with time. 
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waves of common origin and hence no credible claim of detection of gravitational waves by this method 
has been made yet. 


Problems 


1. Assess gravitational waves as evidence in support of general relativity. 
Answer: In assessing gravitational waves as evidence in support of general relativity we note that 
there are two aspects of this test: a qualitative aspect which is the mere existence of gravitational 
waves, and a quantitative aspect which is the measurement of the properties of these waves such as 
their amplitude and frequency. 
Regarding the qualitative aspect, we note that the endorsing value of this aspect is very limited because 
as we pointed out earlier the existence of gravitational waves can be accommodated within theories 
other than general relativity and it was a subject of contemplation and investigation even in classical 
physics (see § 8.6). So, it is not specific to general relativity and hence it is not conclusive in its 
endorsement to the theory. In fact, any tempo-spatial gravity theory that accepts the finity of the 
speed of propagation of gravitational interactions should imply the existence of gravitational waves (in 
some sense). 
Regarding the quantitative aspect, we note that although the endorsing value of this aspect is significant 
and could be decisive in endorsing one theory or another, there is no conclusive evidence in support of 
general relativity from this aspect. The reason is that indirect observations have no quantitative aspect 
from this perspectivel?75l (since there is no actual detection of waves to measure their properties and 
see if they agree with the predictions of general relativity or not) while direct observations have many 
uncertainties from a quantitative perspective and hence they cannot determine beyond any doubt if 
the observations agree quantitatively with the predictions of general relativity or not. Moreover, some 
metric gravity theories can also provide similar quantitative predictions. 
In brief, gravitational waves (whether allegedly observed directly or indirectly) is at the best a qualita- 
tive evidence, moreover it is not specific in endorsing general relativity since it can also be explained by 
other theories. Furthermore, its endorsement to the full general relativistic formulation (rather than 
linear or semi-classical formulation) is questionable. 


Exercises 


1. Provide a brief description of LIGO. Also, list some of the alleged observations of gravitational waves 
made by LIGO. 

2. Provide a brief description of LISA. 

Assess the recent claims of detecting gravitational waves by LIGO. 

4, Assess the value of the experimental projects (such as LIGO and LISA) about the detection and 
observation of gravitational waves. 

5. Discuss briefly the “elastic body” method for the direct detection of gravitational waves. 


. 


9.7 Black Holes 


In our view, black holes are not a conclusive evidence in support of general relativity. In the following 
points we outline some of our reservations and observations about this alleged evidence: 

e Although the existence of black holes is advocated by some physicists as a scientific fact and may be 
regarded as one of the strong evidence in support of general relativity, their existence is not confirmed 
beyond any doubt. Moreover, even if the existence of black holes is confirmed this does not apply to 
many details about their existence and their physical structure and properties. For example, there may 
be evidence for the existence of stellar black holes and super-massive black holes but there is no evidence 
for the existence of primordial black holes. Also, there is no evidence for the theoretical models of their 
structure (e.g. singularity surrounded by event horizon and photon-sphere, etc.). In fact, there are many 
question marks on many details about black holes in general, the claimed evidence on their existence, 
their types and their alleged physical structure and properties. Most parts of the physics of black holes 


[275] In fact, even the qualitative aspect of indirect observations is generally contemplative and has no decisive value. 
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are based on pure theoretical models (which are generally based on general relativity) and the existence of 
black holes does not imply that their detailed physics follows these models. Moreover, some of the claimed 
evidence is motivated by the search for evidence in support of these theoretical models and predictions 
and hence there are many sources of bias and lack of objectivity and impartiality. Therefore, we should 
be cautious about the physics of black holes and the claimed evidence on their existence and the alleged 
circumstantial evidence in support of some details of their physics. 

e If there is a conclusive evidence about the existence of black holes then their existence can be explained 
even by classical gravity, as well as by other gravity theories, and hence they are not conclusive evidence 
for general relativity. Yes, there are certain consequences and implications about black holes that are 
specifically derived from general relativity and hence they may provide supporting evidence for general 
relativity. However, none of these consequences and implications have been proved definitely by experiment 
or observation and therefore we see no evidence from the physics of black holes to support general relativity 
specifically and definitely. 


Exercises 


1. Assess black holes as evidence for general relativity. 
2. Assuming Hawking radiation, calculate the power radiated by a black hole of one Earth mass and 
comment on the result. 


9.8 Geodetic Effect 


In this section we investigate two claimed evidence in support of geodetic effect of general relativity (see 
the subsections of this section). Apart from the feebleness of these claimed evidence from a practical 
perspective, the claimed evidence may also be challenged theoretically on the basis of lack of meaningful 
physical significance of momentum (and hence its conservation) on global scale within the framework of 
general relativity. This should cast a shadow on the rationale and significance of the implications of these 
claimed evidence. However, the challenge may be questioned by the reliance of this effect on its local 
significance as demanded by parallel transport (although even this can be challenged by questioning the 
value and significance of parallel transport). We may also add another theoretical reservation that is 
according to general relativity spinning objects deviate in their trajectories from the geodesic paths of 
non-spinning objects and this should introduce an error into the general relativistic formulation of geodetic 
effect which is based on assuming geodesic trajectories. However, this deviation (and hence the error) is 
negligible when the spinning is not excessive (which presumably is the case in the claimed evidence). We 
should also note that the calculations in the experiments of this test (see the subsections of this section) 
seem to be based on ignoring the difference between physical variables and coordinate variables which is 
important in this case even though the gravitational field is weak and the spacetime is almost flat. This is 
due to the violation of locality and the accumulation of tiny relativistic effects over extended tempo-spatial 
interval (similar to the situation in the case of perihelion precession; see § 8.1 and § 9.1). In fact, the 
violation of locality should magnify even the effect of the aforementioned “deviation from the geodesic 
paths due to spinning” and make it significant. 


9.8.1 Precession of “Moon-Earth Gyroscope” in Motion around Sun 


This is an effect predicted by de Sitter in his analysis of the revolution of the “Moon-Earth gyroscope” 
around the Sun. It is claimed in the literature that this effect is verified in a number of experiments 
using radio interferometry and laser ranging techniques and the accuracy of the agreement between the 
experimental results and the predictions of general relativity ranges between 10% and 1%. However, there 
are many question marks and uncertainties about these experiments and associated claims as well as the 
insufficiency of the reported accuracy (at least in some cases) to establish the claimed proposition firmly 
and definitely. Anyway, the general challenges to geodetic effect (as outlined in § 9.8) should be sufficient 
to question the significance of this test in endorsing general relativity. 
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9.8.2 Gravity Probe B 


This is a space experiment in which 4 gyroscopes on board a satellite orbiting the Earth in a polar orbit 
are used to test the general relativistic prediction of geodetic effect (as well as frame dragging which 
will be investigated in § 9.9.2). Despite the common claims that the experiment confirmed the general 
relativistic prediction of geodetic effect to high level of accuracy (about 1% and later reduced to about 
0.5%), there are serious suspicions and challenges to these claims where some respected physicists disputed 
the accuracy of this experiment (and perhaps even some of its theoretical aspects). We refer the interested 
reader to the literature about this issue. Anyway, the general challenges (as outlined in § 9.8) should be 
enough to question the endorsing value of this test. 


9.9 Frame Dragging 


Before we go through the claimed evidence in support of frame dragging, we should note that there are 
conceptual and theoretical difficulties about the significance of frame dragging and the interpretation of 
its formalism as well as its relation to the Mach principle and geodesic motion. Some of these issues (at 
least) should cast a shadow on the significance and value of some of the claimed evidence in support of 
frame dragging and its general relativistic foundations. The interested reader should refer to the literature 
of frame dragging for details. We should also note that the calculations in the experiments of this test (see 
the subsections of this section) are apparently based on ignoring the difference between physical variables 
and coordinate variables which is important in this case even though the gravitational field is weak and the 
spacetime is virtually flat. This is due to the violation of locality and the accumulation of tiny relativistic 
effects over extended tempo-spatial interval (similar to the situation in the case of perihelion precession; 
see § 8.1 and § 9.1). Anyway, as we will see there is no credible evidence in support of frame dragging. 


9.9.1 LAGEOS Satellites 


LAGEOS satellites (which stands for LAser GEOdynamics Satellites) are simple spheres covered in retro- 
reflectors. These satellites are used for laser ranging from Earth-based stations. It is claimed that the 
orbital planes of these satellites are observed to change in accordance with the predictions of frame dragging 
of general relativity. However, these claims are highly disputed because even the lowest estimates of the 
errors in these observations are too high to be conclusive and hence this test was dismissed even by loyal 
general relativists. 


9.9.2 Gravity Probe B 


Apart from the suspicions about the results of the Gravity Probe B experiment in general, the frame 
dragging effect in this experiment is reported to be below the noise level and hence the result of frame 
dragging is inconclusive even according to the supporters of this experiment.!?76l Also, it may be claimed 
that if the two effects (i.e. geodetic effect and frame dragging) in the Gravity Probe B experiment are 
mixed together (as we expect) then the value of the entire experiment could become questionable in 
endorsing any particular effect. However, we should note that the total gyroscopic shift is made of two 
components: a component in the polar orbital plane due to geodetic effect, and a component in the 
equatorial plane (assuming the gyroscope spin points initially in a direction parallel to this plane) due to 
frame dragging effect (or Lense-Thirring effect) and hence these components are perpendicular and should 
be distinguishable from each other. Nevertheless, the uncertainty in one of these components could still 
affect the uncertainty in the other component. In fact, we may require the validity of superposition 
(or additivity of effects) as well to justify the analysis of the results of this experiment (although this 
requirement may be questioned for the same reason, i.e. being perpendicular and distinct). Nevertheless, 
the potential requirement of superposition could cast a shadow on the combined effect and hence on the 
confidence in the results of the individual effects. 


[276] To the best of our knowledge, the best estimate of error in the frame dragging effect of this experiment is 15%. The 
interested reader should refer to the literature for details. 
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9.10 Wormholes and Other Fantasies 


So far, no discovery of a wormhole (or one of the other fantasies) has been announced. However, we 
should not be surprised if such an announcement is made in the future thanks to the enthusiastic race in 
endorsing general relativity and the general attitude among physicists toward its validity. 


9.11 Cosmological Predictions 


As indicated earlier (see for example § 8.11), the cosmological consequences and predictions that are 
extracted and formulated according to the general relativistic framework are largely based on speculations 
and contemplations as well as philosophical and metaphysical ideas and hence we do not expect any genuine 
confirmation of the majority of these consequences and predictions. In fact, some of these consequences 
and predictions are outside the realm of physical science and hence they are not scientifically verifiable 
(e.g. many details related to the development of the alleged Big Bang). However, no one should be 
surprised if alleged confirmations to general relativity from these cosmological predictions emerged in the 
future. In fact, even now we can find some claims of cosmological evidence in support of general relativity 
by some enthusiastic relativists. 


9.12 Tests of the Equivalence Principle 


Experimental tests of the weak equivalence principle may be considered as indirect evidence in support 
of general relativity. However, most (if not all) these tests are actually tests to the classical equivalence 
principle (which is undisputed) and hence they have no significance or value in endorsing general relativity. 
In fact, from a practical perspective the classical equivalence principle contains all the actual physical 
content of the weak equivalence principle and hence the classical equivalence principle should be able 
to explain any alleged physical consequences of the weak equivalence principle. In other words, the 
value added by the weak equivalence principle to the classical equivalence principle is no more than an 
unverifiable theoretical claim. This should be supported by what we saw earlier (refer to § 1.8.2) that even 
some general relativists challenge the validity of the weak equivalence principle and this has no meaning 
other than challenging its theoretical implication and content because no one can question the validity of 
the classical equivalence principle. 

Anyway, even if we accept that these tests are evidence for the weak equivalence principle they are not 
specific to general relativity as there are many gravitational theories that rest on this principle. In other 
words, the validity of the weak equivalence principle is not an evidence for the validity of the formalism 
of general relativity specifically (although it may be claimed to be an evidence for the validity of the 
family of theories that rest on this principle). In fact, the validity of the weak equivalence principle is not 
evidence for the validity of a specific theory that is based on this principle (whether this theory is general 
relativity or another theory in that family) or even the validity of that family in general because wrong 
theories can rest on correct and valid principles, e.g. a wrong maximization (rather than minimization) 
theory may rest on a valid variational principle. 

In any case, there is no evidence in support of the strong equivalence principle (and possibly there is 
evidence against it as claimed by some and as will be seen later) despite some unreliable claims, which 
are mostly based on theoretical arguments, about its validity. In fact, what is critically important and 
significant to general relativity is the strong equivalence principle (which alleviates the theory from being 
just a gravity theory to become a “General Theory”) because the weak equivalence principle is essentially 
similar to the classical equivalence principle (which is unquestionable) with some unverifiable and arguable 
theoretical extensions as we explained earlier. 


Exercises 


1. Assess the value of the equivalence principle tests as endorsing evidence to general relativity. 
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9.13 Tests of Special Relativity 


As we noted earlier in the preamble of this chapter, the tests of special relativity are considered by some 
to be tests of general relativity. This may be justified by the claim that “special relativity holds true 
locally in the spacetime of general relativity” or by the claim that “special relativity is a special case of 
general relativity or contained in it”. However, this may be challenged by the following: 

e The locality condition deprives these tests from any value to general relativity as such because as long 
as we are assuming that we are within the domain of special relativity (even if this is only at local level) 
then the spacetime is flat and hence there is no significance of such evidence to general relativity as a 
theory for the entirety of the curved spacetime. This means that any implication or effect that is specific 
to general relativity cannot be obtained from such tests and hence it cannot be endorsed by these tests. In 
other words, these tests have no general relativistic content and hence they cannot be supporting evidence 
to general relativity as such, so they are only tests to special relativity (whether independently or within 
the general relativity at local level). 

e Based on the previous point, any claim of endorsing general relativity by these tests implies duplication 
since this means endorsing special relativity by these tests once independently and once as a local appli- 
cation to general relativity. In other words, we are using the same evidence to prove the same premise 
twice and this should have no added value because it is mere repetition. 

e The premise that “special relativity holds true locally in the spacetime of general relativity” has only 
theorized value with no real physical content. In other words, the originator of the theory of general 
relativity arranged the theory in such a way that special relativity holds true locally without deriving 
such a premise from the formalism of general relativity as represented by the Field Equations. To be 
more clear, we have a general-relativity-specific formalism represented by the Field Equations (and their 
direct consequences) and a special-relativity-specific formalism represented by the Lorentz transformations 
(and their direct consequences) and these two formalisms are independent of each other apart from the 
theorized connection (which is rather arbitrary) that is established by the originator of the theory of general 
relativity. The situation will obviously be different if this connection was a natural connection (i.e. by 
the convergence of the general relativistic formalism to the special relativistic formalism). Accordingly, 
even if the above premise is true it does not prove anything about general relativity as a gravity theory 
as represented by the Field Equations. 

e The claim that “special relativity is a special case of general relativity or contained in it” is disputed 
(see § 1.7 and refer to B4). In fact, this claim can be challenged and rebutted. Moreover, the implications 
of this premise should depend on the meaning of “special case” or “contained” and this is not obvious or 
agreed upon. In fact, we may even claim that, according to some opinions and interpretations about the 
relation between special relativity and general relativity, the use of the special relativistic tests to endorse 
general relativity is circular since their validity as general relativistic tests depends on their validity as 
special relativistic tests while their validity as special relativistic tests is supposed to be a requirement (or 
special case) of general relativity. 

In brief, there is no relation between general relativity as a gravity theory and special relativity and 
hence no test for one theory can be seen as a test for the other. Yes, there is a relation between general 
relativity as a “General Theory” and special relativity (due to the claim of local application of special 
relativity in the spacetime of general relativity). However, the physical setting in the two theories is 
different (i.e. global inertial frames of flat spacetime in special relativity, and local non-inertial frames of 
curved spacetime which are allegedly equivalent to inertial frames in general relativity) and hence the test 
for each theory should belong to that theory alone and cannot be considered a test for the other theory. 
Yes, if we ignore the difference in the physical setting (or abolish this difference by a theoretical rationale 
like the equivalence principle) then these tests have no general relativistic content and hence they cannot 
be supporting evidence to general relativity as such (as indicated earlier). 

Anyway, even if we interpret the evidence of special relativity as evidence for the validity of special 
relativity locally in the spacetime of general relativity (as required by the equivalence principle) and 
hence it is evidence for general relativity as a “General Theory”, the generality of this cannot be established 
(noting that without this generality the evidence is useless). In fact, the claimed evidence against the 
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strong equivalence principle (or at least against its generality) should invalidate this generality and hence 
the special relativistic evidence cannot endorse general relativity as a “General Theory” due to the lack of 
generality. 


9.14 Circumstantial Evidence 


In the literature of general relativity there are claims of many circumstantial evidence in support of 
general relativity such as the luminosity of quasars (which indicates accretion of matter onto black holes 
as a viable mechanism for energy release) or the distortion of some spectral lines in certain astronomical 
observations (which indicates the existence of black holes with certain characteristics). However, all these 
alleged circumstantial evidence and indications are neither specific to general relativity nor conclusive and 
definite in their value and significance and hence they do not provide any credible support to this theory. 


9.15 Evidence for Newtonian Gravity 


The evidence in support of Newtonian gravity in its classical domain is overwhelming and this may be 
seen as evidence for general relativity as well due to the convergence of general relativity to the Newtonian 
gravity in the classical limit (see § 5.1). However, this type of evidence is in fact an endorsement to the 
classical formulation (whether in the Newtonian gravity form or in the converged general relativity form 
which is a special case of the general formulation of general relativity) and hence it cannot be regarded 
as evidence for the general formulation of general relativity as represented by the Field Equation. In 
other words, the evidence in this special case is not a valid evidence for the general case (which is the 
important thing to establish since the special case of classical gravity is already established regardless of 
general relativity). In fact, what is significant for the endorsement of general relativity is the evidence 
for the general formulation which distinguishes general relativity from Newtonian gravity rather than the 
evidence for this special formulation because the classical formulation is sufficiently endorsed and hence it 
does not need more evidence. Yes, the convergence of general relativity to the Newtonian gravity (which 
is a well established theory considering its limitations and restrictions) in the classical limit shows that 
general relativity is compliant with the requirement of the correspondence principle (see § 1.8.3) and this 
is not the same as being an evidence in support of the theory of general relativity. 

We should also add two other factors that diminish the value of this type of evidence in endorsing 
general relativity: 
e The convergence of general relativity to the Newtonian gravity is not a naturally occurring convergence 
but it is a planned convergence since the formulation of general relativity is tuned to converge to the 
Newtonian gravity in the classical limit and hence this convergence has no value from the perspective 
of endorsing general relativity. In other words, general relativity is formulated in such a way that it 
converges to the Newtonian gravity in the classical limit (ie. it is forced to converge). In fact, this 
planned (or premeditated) convergence can be easily seen from the demonstration (in this book as well 
as in other books) of how general relativity converges to the Newtonian gravity in the classical limit, 
e.g. how « is defined to fulfill this convergence as seen in § 5.1. Yes, if the theory of general relativity 
was formulated without awareness or regard to the Newtonian gravity and it appeared later that general 
relativity converges to the Newtonian gravity then this convergence could be significant in endorsing 
general relativity (conditionally and in this case). 
e The formal arguments used to establish the convergence of general relativity (as represented by the 
Field Equation) to the Newtonian gravity in the classical limit are questionable in many details. In fact, 
these arguments (whether those given in this book, as presented in § 5.1, or their alike in the literature) 
which supposedly establish this convergence are based on many questionable assumptions, approximations, 
twisting, etc. and hence they can be challenged rather comprehensively and systematically.|?77| 


[277] Despite this, we did our best in this book to present the arguments and derivations of general relativity (whether related 
to the above convergence or to something else) in the most logical and sensible form (although they are usually assessed 
subsequently in a more realistic style that exposes their potential weaknesses). 
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9.16 Final Assessment 


The claimed evidence in support of general relativity can be classified into several main types (which are 

not mutually exclusive): 

1. Evidence that is not valid at all in its endorsement to general relativity. An example of this is the tests 
of the equivalence principle since these tests are related to the classical essence of this principle rather 
than to the general relativistic essence; moreover the evidence for the principle is not an evidence for 
the theory. Another example is the tests of special relativity since these tests have no endorsing value 
to general relativity as such. A third example is the evidence for Newtonian gravity as discussed in 
the last section. 

2. Evidence that the theory was initially tuned to match (i.e. the theory was formulated in a way that 
yields the anticipated prediction). The obvious representative of this type is the precession of the 
perihelion of Mercury. As this type of evidence does not represent an actual prediction of the theory 
its value is rather limited, i.e. all the evidence can imply is that the theory is consistent with some 
observations. In fact, even the evidence for Newtonian gravity may be classified into this type (if it is 
regraded as a valid evidence for general relativity in this special case). 

3. Evidence that is based on simple principles and arguments without the need for the full formulation 
of general relativity. An example of this type is the gravitational frequency shift or gravitational time 
dilation which can be obtained from the equivalence principle. The value of this type of evidence is also 
limited because it is not an evidence for the theory as a whole but it is an evidence for the principles (e.g. 
equivalence principle) which the evidence is based on. Although some of these principles may constitute 
essential components or foundations for the formalism of the theory the evidence is not conclusive in 
endorsing the theory itself. In fact, a survey of the literature of general relativity shows that a number 
of the consequences and predictions that form the basis of numerous alleged evidence are not necessarily 
based on the full formalism of general relativity, but they can be obtained from general and simple 
arguments (some of which are not related at all to the framework of general relativity). In this context, 
it is important to note that even from a historical perspective several of these consequences have been 
formulated and predicted (at least qualitatively) before the full formulation of general relativity and 
the emergence of the Field Equation. 

4. Evidence that is questionable in its significance and value since it can be explained by other theories. An 
example of this type is the evidence that can be explained and interpreted by alternative gravitational 
theories, e.g. retarded gravitational potential theory whose prediction about the precession of Mercury 
perihelion is similar to the prediction of general relativity. Also, many instances of gravitational 
frequency shift (and possibly even gravitational time dilation) fall in this category since they can be 
explained classically. In fact, we can include in this category almost all the qualitative evidence that are 
not specific to general relativity such as the existence of black holes since it can be predicted in principle 
by any valid gravitation theory including classical gravity because there should be no restriction in any 
valid and sensible gravitation theory on the existence of high concentrations of matter with certain 
gravitational properties (such as being “black”). This similarly applies to gravitational lensing and 
gravitational waves. In fact, if we include modern metric gravity theories (which emerged before and 
after general relativity) then almost all the predictions (whether qualitative or quantitative) of general 
relativity can be obtained from other theories and hence they are not specific and definite in their 
endorsement to general relativity. This is unlike, for example, quantum mechanics (and potentially 
even Lorentz mechanics) where no credible alternative theory can replace it and provide similar or 
identical predictions and explanations. 

5. Evidence that is based on trust. Such evidence cannot be accessed and assessed independently and 
hence its value is questionable because fraud, bias, mistakes and illusions are commonplace in science. 
In fact, they could be more common in science than in daily life due to the complexity of science and 
its special position and prestigious status. 

6. Evidence that is questionable in its authenticity as there are indications of falsification or exaggeration. 
An obvious example of this sort of evidence is the light deflection by the Sun in the 1919 solar eclipse 
expedition of Eddington and his team where the evidence was sexed up. It is wrong to believe that 
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10. 


11. 


this historical incident is a one-off event and it is a rare exception. In fact, there are other examples 
(or at least potential examples) of this type of alleged evidence (especially in the exotic experiments 
and observations) but we do not go through the details. As indicated earlier, fabrication, prejudice 
and exaggeration are as common (if not more common) in science as in daily life. 

Evidence that is based on simplifications and restricting assumptions in the formulation of general 
relativity and hence it is based on special conditions and compromising assumptions for the application 
of the theory. An example of this is some claimed astronomical observations whose implications are 
based on the use of a particular metric (e.g. Schwarzschild) or a conditional solution or a special 
case. Such evidence cannot be used to endorse the theory in its entirety and generality. We may also 
include in this category the evidence from certain gravitational regimes (which are mostly Newtonian or 
quasi-Newtonian) since the value of such evidence is rather limited with regard to the general validity 
and applicability of general relativity and its distinctive features against other theories. We may also 
include in this category the evidence from the linearized form of general relativity (e.g. the common 
formulation of gravitational waves) since the evidence on the validity of the linearized general relativity 
is not sufficient to establish the non-linearized general relativity (which is the real general relativity 
that critically needs evidence). 

Evidence that is not conclusive due to many uncertainties and large error margins. Examples of this 
type of evidence include most light bending observations as well as at least some claimed observations 
of gravitational waves and frame dragging. 

Evidence that is not independent of other types of evidence such as gravitational frequency shift which 
is correlated to gravitational time dilation. Such evidence has no value as independent evidence since 
it targets the same or similar aspect of the theory. 

We should also note that almost all the claimed evidence in support of general relativity are related 
to the theory as a gravity theory and not as a “General Theory”. Therefore, even if all these evidence 
are valid and conclusive they just establish the theory (or rather part of it) as a gravity theory and 
not as a “General Theory”. 

The overwhelming majority of the claimed evidence is based on the Schwarzschild metric and this 
should diminish the value of the evidence due to the Schwarzschild restriction. For example, the 
obtained results could be specific to this metric. Moreover, if this metric (or its equivalent) is obtained 
from another theory or argument then the evidence will not be conclusive in its endorsement to general 
relativity specifically. 


To sum up, the claim that the theory of general relativity is supported by overwhelming and conclusive 
scientific evidence or it is proved beyond any doubt should be rejected. All the claimed evidence (even with 
the exclusion of the possibility of fabrication, exaggeration, error, etc.) do not provide full endorsement 
to the theory in its entirety and over its whole domain of validity. In fact, we may even claim that there is 
evidence against at least certain aspects and implications of general relativity such as the validity of the 
strong equivalence principle (and hence the validity of the “General Theory”). In brief, if we compare the 
evidence in support of general relativity to the evidence in support of other respected theories of modern 
physics, such as quantum mechanics and Lorentz mechanics, we can conclude that the evidence for general 
relativity is feeble and limited. 


Problems 


1; 


What is the strongest evidence in support of general relativity? Assess its significance. 

Answer: Such a question has no definite answer since it depends on personal views and many ques- 
tionable and controversial factors. However, in our view the strongest evidence in support of general 
relativity is seemingly the perihelion precession of Mercury. This may also apply to other claimed 
evidence of precession in orbital motion, e.g. the evidence of periastron precession in some binary 
pulsar systems. 

As we saw, this evidence can also be explained by other theories (e.g. retarded gravitational potential 
as well as other metric gravity theories) and hence it is not definite in its endorsement to general 
relativity specifically. Moreover, the theory of general relativity was tailored to match this prediction 
and hence the value of this evidence should be diminished by this factor (although this may apply only 
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to the precession of Mercury). We should also mention that there are technical challenges to the theo- 
retical prediction of general relativity where the effect of the curvature of spacetime is partly ignored 
in the formulation and calculation (e.g. by ignoring the difference between physical and coordinate 
variables which should be important due to the violation of locality and accumulation of tiny effects; 
see § 8.1 and § 9.1). So in brief, even this “well established” evidence is not really well established as 
claimed. 


Exercises 


1: 
2. 
3. 


4. 


Discuss the main types of evidence of general relativity and assess their value and significance. 

List some general factors that could diminish the value of the evidence in support of general relativity. 
Our assertion that the predictions that are not specific to general relativity (by being classically 
explicable for instance) have limited endorsing value may be challenged. Discuss this issue. 

Why factors like fraud and error should be considered more seriously in assessing the evidence in 
support of general relativity? 


9.17 Evidence against General Relativity 


The investigation in the previous sections of this chapter was largely about the evidence in support 
of general relativity. Regarding the opposite evidence, we believe there are evidence against general 
relativity especially in its “General Theory” side. For example, the indications about violations of the 
strong equivalence principle and its general validity or the indications about violations of the relativistic 
paradigm of speed limit (ie. c is the specific speed for massless objects and it is the ultimate speed for 
any physical object) should undermine general relativity and put serious question marks on its general 
validity. Some of these evidence and indications against general relativity will be investigated in the next 
chapter. 


Chapter 10 
Challenges and Assessment 


In this chapter we investigate a number of challenges and criticisms to the theory of general relativity. We 
will also try to assess general relativity from a general perspective. In fact, we discussed throughout the 
book many detailed challenges and criticisms to general relativity and assessed the theory from different 
aspects and perspectives. However, most of those were specific and of technical nature. So, in this chapter 
we outline some general challenges and criticisms to this theory, its epistemology and methodology with 
an assessment of a general and non-technical nature (noting that the contents of this chapter are generally 
based on the contents of the previous chapters). 


10.1 Challenges and Criticisms 


There are many theoretical and practical challenges and criticisms to general relativity; most of which 
are by its opponents and some are even by its proponents. In the following subsections we list and briefly 
investigate a number of examples of these challenges and criticisms (noting that some of these challenges 
and criticisms are specific to this book and hence they represent the view of the author specifically). 


10.1.1 Limitations and Failures of the Equivalence Principle 


Because the equivalence principle is at the heart of general relativity and one of its most fundamental 
propositions, any limitation or failure related to this principle is a major blow to the theory and its 
theoretical and epistemological framework. In this regard we note the following: 

e There are fundamental theoretical challenges to the equivalence principle. In fact, the equivalence prin- 
ciple is challenged and rejected even by some of the followers of general relativity (see for example Synge 
in the References) due to the lack of rationality as well as physical evidence. For example, the equivalence 
principle (at least in its strong form) can be challenged by the possibility of the dependence of physical 
laws on the existence (and non-existence) of gravitational field gradients and tidal forces (regardless of 
being detectable by our observation or not). Moreover, even if we assume that the apparently-intuitive 
theoretical arguments in support of the equivalence principle can establish the weak form, they cannot 
establish the strong form. In fact, using the locality as a basis for having good quantitative approximation 
may be valid with respect to the weak form but it cannot be used to establish the strong form since the 
existence (and non-existence) of gravitational field gradients and tidal forces may result in a qualitative 
difference between gravity and acceleration and this qualitative difference may invalidate the strong form 
altogether, i.e. the physical system in which field gradients and tidal forces exist can be totally different 
in its behavior from a similar physical system in which no field gradients and tidal forces exist.!?7°| 

e There is no real evidence for the validity of the equivalence principle (neither the weak nor the strong). 
This is because all the claimed evidence for the weak equivalence principle are actually for the classical 
equivalence principle with no evidence that can be claimed to support the theoretical content of the weak 
equivalence principle (see § 9.12). The lack of evidence for the strong equivalence principle is more obvious. 
As we indicated earlier, there is no evidence in support of the strong equivalence principle even according 
to the view of some general relativists. In fact, we may even claim (with some general relativists) that 
there is evidence against the strong equivalence principle (some examples will be given later)./?791 


[278] Ty fact, the difference between gravity and acceleration (in the context of the equivalence principle in general) could be 
in their nature and hence it is more fundamental than by being having or not having field gradients and tidal forces. 

[2791 In this context, it is useful to be aware of the opinion of some general relativists about the equivalence principle as 
expressed in the following quote (which is taken, with some modifications, from lecture notes by Sean Carroll): “the 
principle of equivalence serves as a useful guideline but it is not a fundamental principle of nature and hence this 
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e Apart from the above physical challenges to the equivalence principle (whether weak or strong), we may 
also mention some epistemological challenges to this principle. Some of these challenges are discussed in 
the Exercises. 

e There are several sources of limitation and ambiguity in the equivalence principle. For example, to what 
type of gravitation rotating and decelerating frames correspond? And what role should be attributed in 
this context to absolute frame (see for example § 6.5)? 

e The huge mess in the literature of general relativity about the equivalence principle (definition, classifica- 
tion, implication, interpretation, validation, etc.) discredits many of the claimed aspects and significance 
of this principle and degrades its integrity and credibility. In fact, the equivalence principle is one of the 
best examples in modern physics for controversies, contradictions, arbitrariness, and chaos. Most of these 
differences are based on lack of clear vision and consistent logic in dealing with this thorny issue where 
inherent ambiguities in the theory itself and numerous complexities in its theoretical framework allow 
many to dictate their personal views and validate their own results by alleged tests and verifications many 
of which are scientifically baseless. 

e The locality condition should mean (and it is actually interpreted by some relativists as such) that 
the domain of the physical experiments should be restricted to the “local” region of the space, i.e. the 
experiments must be isolated from any influence or effect or consequence related to objects and fields 
outside that region. This restriction should further degrade the value and significance of the equivalence 
principle because if we remove this artificial restriction then the outcome of an experiment in a gravitational 
field could be totally different from its outcome in an equivalent accelerating frame. For example, an 
electric charge at rest in a gravitational field does not radiate while an identical electric charge in an 
equivalent accelerating frame should radiate.!?8°! In fact, examples like this (and there are many) should 
invalidate the strong equivalence principle altogether (or at least some of its implications and significance). 
e As indicated earlier, it is quite possible that there is a qualitative difference between gravitation and 
acceleration due for example to the existence of field gradients and tidal forces in the former but not the 
latter.?84] This should invalidate any argument in support of the equivalence principle as a practically 
useful guiding rule (as some general relativists claim despite their doubts about the principle as a rigorous 
rule). 

e As indicated in a previous point and can be understood from previous and upcoming examples, the 
equivalence principle is a local rule. This does not only mean that the equivalence principle is a local rule 
of limited validity but it also means that even if we extend the locality of the rule slightly it could fail 
(i.e. it could fail even in a sufficiently extended local neighborhood). The implication of this “confined” or 
“restricted” locality is that this alleged principle is an entirely artificial principle and hence it has no real 
physical value or significance. In other words, to make this principle valid and functioning we should create 
or determine or define our physical system in such a way that makes this principle valid and functioning 
(and this is really ridiculous). 


Exercises 


1. Discuss the failure of the alleged equivalence between inertial and gravitational mass in achieving this 
equivalence. |?82| 

2. According to the equivalence principle of general relativity, a stationary observer relative to a source 
of gravity (e.g. a terrestrial lab) is equivalent to an accelerating observer. Analyze and discuss this 
proposition (which should lead to another failure of the equivalence principle). 

3. As part of the extension of special relativity to include the domain of general relativity, an accelerating 
frame should be represented by a series of instantaneous inertial rest frames (IIRF) where special 


principle is not rigorously true”. 

[280] We should note that even the locality restriction may not be able to address this challenge. 

[281] As indicated, the existence of field gradients and tidal forces is just a possibility and example; otherwise the difference 
between gravitation and acceleration could be more fundamental and hence it does not need even the existence of field 
gradients and tidal forces (or any other reason) for its existence. 

[282] As discussed earlier (see § 1.8.2), the alleged duality of inertial mass and gravitational mass is a general relativistic issue 
and not a classical issue since in classical physics mass is mass whether it is in Newton’s second law or in Newton’s 
gravity law. Accordingly, this exercise is about a challenge to general relativity specifically. 
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relativity applies in these frames. Discuss and assess this. 

4. Analyze and assess the local application of special relativity in the spacetime of general relativity 
according to the strong equivalence principle. 

5. Give an example of a limitation of the equivalence principle related to the local application of special 
relativity in the curved spacetime of general relativity. 


10.1.2 Necessity of Metaphysical Elements 


The necessity of hypothesizing the existence of dark matter and dark energy to explain certain physical 
formulations and observations should be considered as one of the failures of general relativity. In fact, the 
requirement of dark matter and dark energy is a failure not only to general relativity but to any theory 
that requires such dark objects in its theoretical framework or in its practical applications. Generally, dark 
objects are metaphysical objects even if they are incorporated in an ostensible physical theory. We should 
note that although dark matter and dark energy are not built in the basic theoretical structure of general 
relativity,!?84] they are strongly associated with it and its interpretations and applications (especially in 
cosmology) and for this reason general relativity can be criticized for its potential and aptitude to allow 
and accommodate such non-scientific concepts and entities. We should also note that dark matter may be 
legitimate in some cases where there is an indirect evidence for the existence of such matter independent 
of the theoretical necessity. In fact, we briefly discussed this issue earlier (see for example exercise 4 of § 
3.3). 

We may also include (by extending the meaning of metaphysics) the incorporation of non-physical 
concepts in physical theories such as the concept of singularity (e.g. of black holes and Big Bang) or the 
creation of spacetime in no-time and no-where or the expansion of the Universe (during and following the 
Big Bang) with no center, no-time and no-where. Although these concepts and ideas may be legitimate and 
meaningful in mathematics or philosophy or religion they are not legitimate in physical sciences because 
they have no scientifically verifiable meaning or content (or rather they have no scientific meaning at all). 
In fact, most of these non-physical elements originate directly or indirectly from fundamental principles 
and paradigms in the theoretical framework and epistemology of general relativity such as merging and 
incorporating spacetime into the fabric of the physical phenomena (or the other way around) as expressed 
by the principle of metric gravity (see § 1.8.4) instead of treating spacetime as a container in which physical 
phenomena take place. 


Exercises 


1. Outline the problem of dark matter. 

2. Argue against the existence of dark matter. 

3. It may be claimed that the existence of dark energy can be justified by the “observed” expansion of 
the Universe and hence dark energy has physical justification and evidence. How do you address this 
claim? 


10.1.3. Creation Theory 


It may be claimed that general relativity is a creation theory (and hence it contains non-scientific elements) 
due for instance to the fact that it does not conserve energy. As indicated earlier (see § 1.9), a creation 
theory cannot be totally scientific theory because creation requires supernatural elements and agents and 
hence it belongs to religion or philosophy, for example, rather than science.|?841 However, the failure of 
energy conservation at global scale does not require creation if it is related to the relativity of time and 
frequency for instance.!?85! Yes, the creation of dark energy is required in some relativistic cosmological 


283] This should be more true with respect to dark matter because dark energy is related to the cosmological constant which 
is built (although not necessarily) in the formulation of general relativity as represented by the Field Equation (see § 
3.3). 

284] We note that “creation” in this context is more general than “creation” and “annihilation” in their non-scientific impli- 
cation and significance. 

285] As discussed earlier (see for example § 7.9), since time is relative the principle of conservation of energy should lose its 
significance at global level. This is because the relativity of time means there is no global significance to time or any 
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models and this could be considered as an implication of general relativity and hence general relativity 
can be classified on this basis as a creation theory because it (at least) allows creation. There are other 
creation aspects in the applications and interpretations of general relativity (rather than in the theory 
itself) such as the creation of spacetime in the Big Bang according to relativistic cosmology (or some of 
its accepted premises) and hence general relativity should take the responsibility for accommodating such 
creation aspects in its applications even if we assume that general relativity itself is not a creation theory. 
In fact, the entire Big Bang theory (which is a fundamental pillar in relativistic cosmology or at least it 
is strongly associated with it) is effectively a creation theory (at least because it implies creation of space 
and time although the Big Bang itself may not be necessarily so if we follow a certain interpretation which 
may be different to what is commonly accepted). 


Exercises 


1. Discuss the issue of conservation of energy in general relativity and its applications. 

2. The necessity of the conservation of energy principle in physical theories may be challenged. What is 
your argument against this challenge? 

3. Assess the assertion that the breakdown of the conservation of energy at global level in general relativity 
can be seen as a logical consequence of the breakdown of global time since time in the spacetime of 
general relativity is well defined only locally. 


10.1.4 Dependence on Special Relativity 


It may be claimed that the dependence of general relativity on special relativity makes general relativity 
susceptible to questioning if special relativity (or Lorentz mechanics) proved to be wrong or incomplete. 
So, the challenges to special relativity may also be seen as challenges to general relativity because special 
relativity is supposed to be at the foundation of general relativity. This can also be justified by the local 
application of special relativity in the spacetime of general relativity. However, this may not affect general 
relativity as a gravity theory because the gravitational formulation of general relativity (as represented 
by the Field Equation and its consequences) is not based on the formulation of special relativity (as 
represented by the Lorentz transformations and their consequences). In fact, there is no direct or formal 
link between the two formulations (see § 1.7 and § 9.13 and refer to B4). Moreover, the local application 
of special relativity in the spacetime of general relativity may not be entirely equivalent to the global 
application of special relativity in the Minkowski spacetime, and hence the failure of special relativity (as 
such) may not necessarily imply the failure of its local application in the spacetime of general relativity. 
However, this should depend on the nature and cause of the failure and hence the failure of special 
relativity could lead to the failure of its local application in the spacetime of general relativity and to the 
collapse of the “General Theory”. So overall, the failure of special relativity should put a question mark on 
the validity of general relativity because special relativity (in one sense or another) is at the foundation 
of general relativity and is strongly related to it.|?°4 


Exercises 


1. Give an example of the challenges to special relativity that potentially threaten general relativity. 


10.1.5 Triviality of General Invariance 


The requirement of invariance of physical laws (i.e. the necessity of expressing the physical laws in tensorial 
form) which is considered as one of the great achievements of general relativity is just a trivial claim or 
convention with no actual physical substance or implementation. In this regard, we note the following 
about the requirement of invariance (or general covariance) in general relativity: 

e As we discussed earlier (see § 1.8.1) the principle of invariance is no more than an epistemological 
convention or demand with minor scientific content and significance. 


physical quantity that depends on time and determines energy such as frequency. 

[286] We should also distinguish in this context between the failure of special relativity (as an epistemological theory of mainly 
interpretative significance) and the failure of Lorentz mechanics (as a scientific theory based on the formalism of Lorentz 
transformations). 
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e Even if we accept the physical necessity (rather than the epistemological necessity) of the invariance 
of physical laws it should mean the necessity of the invariance of the essence and actual content of the 
physical laws and not the necessity of formulating the laws in a particular form (e.g. tensorial form) 
although such a formulation should be beneficial to the objectives of invariance. 

e There is no actual implementation of this proclaimed general invariance in the theory of general relativity 
because this alleged invariance is not more than a demand. Also, the validity of this general invariance is 
not shown in the theory or proved by it in detail and as applied to actual laws (as done for example in 
Lorentz mechanics). 

e The tensors in special relativity are Lorentz invariant, i.e. they are “Lorentzian tensors” and not general 
tensors. The “General Theory” is based on generalizing the formalism of special relativity to apply locally 
in curved spacetimes and this requires the existence of valid general tensorial (covariant) transformations 
which is not guaranteed (in its generality as a minimum). At least, this existence requires demonstration 
and verification. 


Exercises 


1. The principle of invariance is no more than an epistemological convention with minor scientific content 
and significance. Discuss this proposition. 

2. Distinguish between the invariance of laws in essence and the invariance of laws in form. 

3. Clarify the assertion about the triviality of the alleged general invariance (or general covariance) in 
general relativity. 

4, Justify the triviality of the Principle of General Covariance (see Problems of § 1.8.2). 


10.1.6 Interpretation of Coordinates 


As we saw earlier (refer for example to § 6.2), the interpretation of coordinate variables as labels with no 
metrical significance is not intuitive. In fact, this may be intuitive in abstract mathematics and geometry 
but not in physics. Moreover, this leads to practical problems in application where data representing 
physical systems or extracted from experiment and observation are metrical and hence they cannot be 
used in the formulae whose variables are supposed to be coordinate symbols of mere labeling value. In 
fact, the problem of the interpretation of coordinate variables in general relativity extends to many other 
physical quantities and principles (which depend on coordinates in their definition and determination 
such as energy and energy conservation) where they lose their physical significance or require special or 
tentative interpretation (refer to § 7). This does not only complicate and hinder the interpretation and 
understanding of the physics but it also introduces traps, hurdles and complexities in the application of 
the formalism (see Exercises). The reader should finally note that “interpretation of coordinates” in this 
criticism does not only mean attaching a conceptual meaning to coordinates but it also means treating 
them in reality as such (i.e. stripping off their physical and metrical significance in the formalism and 
implementation). 


Exercises 


1. What is the root of the idea that coordinates are labels with no metrical value? 

2. List some of the problems caused by the interpretation of coordinate variables as labels with no metrical 
significance (which originates from employing the paradigm of curved spacetime). 

3. It may be claimed that “As long as the coordinates are not measurable quantities then the actually- 
measurable physical quantities cannot be correlated in reality to these coordinates. In other words, 
the actual link between coordinate quantities and physical quantities is missing in reality because this 
link is theoretical rather than practical”. Discuss this claim. 


10.1.7 Strong Dependency of Physical Results on Coordinate System 


A distinctive feature of general relativity (due to its highly mathematical nature and its geometric ap- 
proach) is the strong dependency of the obtained physical results and consequences on the employed 
coordinate system (as can be easily seen in the difference between the predictions based on the use of 
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Schwarzschild coordinates and those based on the use of other coordinates such as Kruskal-Szekeres co- 
ordinates) where this strong dependency can sometimes lead to contradictions and absurdities.!?87] In 
fact, even if we accept the standard defence of general relativists of blaming the coordinates for obtaining 
conflicting and absurd results, we cannot trust such a theory that can abuse the coordinates and make 
them dysfunctional to such extent. This is because we cannot be sure if the results that look rational 
and are not in conflict with other results obtained from other coordinates are reliable since they can also 
be faulty mathematical artifacts of the employed coordinates even though we are unable to detect this 
by comparing these results to “more rational” results obtained from other coordinates (especially when 
results from other coordinates do not exist to compare with). 


Exercises 


1. Try to find a link between the issue in this subsection (i.e. strong dependency on coordinates) and the 
issue in the previous subsection (i.e. interpretation of coordinates). 

2. Give an example from the literature of general relativity about the strong dependency of the physical 
results on the type of coordinates used in formulating and solving physical problems. Also, discuss 
possible challenges and defence to this criticism. 


10.1.8 Local Relativity 


In special relativity we have “global relativity” of physical quantities represented by the difference of these 
quantities (such as time and length) across frames of reference where each one of these frames is global 
since it covers the entire spacetime. Thus, although we have for example length contraction and time 
dilation (which make length and time relative across different frames) these physical quantities are still 
global in each frame and hence there is no problem in the application of the formalism in each frame 
since the physical quantities are uniquely defined and determined over the entire frame and spacetime. In 
other words, although “relativity” introduces some ambiguity on the physical quantities (since they differ 
between frames) the global nature of this relativity (i.e. being across global frames) makes this ambiguity 
harmless (at least in most cases) because what is largely needed is the application of the formalism in each 
individual frame (and the results can be transformed unambiguously across frames when this is needed). 
This means that physical quantities in the spacetime of special relativity are actually absolute as far as 
each frame in concerned since they are defined globally over the entire (global) frame and spacetime. 
For example, if we have a physical formula that involves length then when we apply this formula in a 
given frame the length is defined and determined uniquely and unambiguously over the entire frame and 
spacetime and hence we have no problem if this length extends over different patches of the spacetime (or 
even over the entire spacetime) because the length in any patch is the same as the length in any other 
patch. Similarly, we have no problem if the length in one patch of spacetime is correlated to the length 
(or length-related quantity) in another patch because length is uniquely defined and determined over the 
entire spacetime. 

The situation in general relativity is quite different (thanks to the spacetime curvature and the non- 
metrical significance of coordinates which affects the significance of physical quantities and makes them 
dependent on the local metric properties) because what we have in general relativity is local frames 
where each one of these frames covers only certain patch of the spacetime (and hence the relativity in 
general relativity can be described as “local relativity” since it occurs across local frames coordinating 
limited patches of the spacetime). Now, due to the curvature of spacetime and the metrical difference 
between different patches, physical quantities like time and length have no global significance since they 
are relative across the different patches of spacetime and hence they cannot be defined and determined 
uniquely across these patches. So, when we have a physical formula that involves length for example 
and this length extends over different patches (e.g. distance between galaxies) and possibly across the 
entire spacetime then there is an inherent ambiguity in the application of this formula due to the inherent 


[287] This may be exemplified by the previously-discussed issue of the time of fall of an object to the singularity or event 
horizon of a black hole (as seen by an observer at infinity) where this time can be finite or infinite depending on 
the employed coordinates. It should be obvious that we are talking above about the Schwarzschild solution in these 
coordinates (i.e. Schwarzschild and Kruskal-Szekeres). 
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ambiguity of global length in such curved spacetime. In fact, this could be one reason why in general 
relativity we have different definitions of physical quantities such as ruler distance and radar distance. This 
ambiguity should affect the uniqueness of any obtained result as well as its interpretation and physical 
significance. It should also introduce practical difficulties in applying the general relativistic formulae 
that involve this type of physical quantities (e.g. how to obtain the physical radius of the Earth or the 
Sun when needed in a formula where integral techniques for example are not feasible or sensible since 
integration in such situations means sum of quantities defined and determined differently over patches of 
different metrical properties). 

In this context, we should remark that integration (which is based on the notion of “sum”) in curved space 
loses its ordinary meaning (as understood in flat space) since physical quantities (whether scalars or non- 
scalars) at different locations cannot be added as in flat space because they are measured and metricized 
differently. In fact, this should explain why physical laws in general relativity are generally differential 
(rather than integral) relations. It should be obvious that associating (such as adding, subtracting, inner 
multiplying, equating, correlating by > or <, etc.) physical quantities (whether scalar or non-scalar) at 
different locations is strictly sensible only in flat space where there is a global and universal reference 
for defining, metricizing and calibrating these quantities. Accordingly, associating physical quantities at 
different locations in curved space is actually a relaxed or sloppy form of association in which paradigms 
like tangent space, parallel transport and local standards of calibration are used. 


Exercises 


1. Can you give an example of a physical quantity in the spacetime of general relativity that is absolute 
and global? What is the significance and implication of this? 


10.1.9 Nonsensical Consequences and Predictions 


A brief inspection of the literature of modern physics in the last few decades reveals that general relativity 
is a major source and fertile ground for growing fantasies and illusions and breeding bizarre ideas such as 
wormholes, time machines and dark energy. The methodology of general relativity has also contributed to 
the incubation of more ridiculous theories such as the string theory in its various forms. In fact, general 
relativity should be given the main credit for transforming physics from its rational classical methodology 
to its irrational modern methodology. This should point out to a fundamental defect in this theory that 
leads to such disastrous consequences (or at least there should be something defective in its methodology, 
philosophy and epistemology). 

Exercises 


1. Give some examples of some nonsensical general relativistic results that violate certain physical rules 
and concepts. 

2. It may be claimed that general relativity is not different from classical physics in having nonsensical 
consequences and predictions because even in classical physics we may obtain nonsensical results (in 
the form of non-physical solutions such as imaginary roots) from legitimate classical formulations. 
Address this challenge. 


10.1.10 Incompatibility with Quantum Mechanics 


It is well known that general relativity and quantum mechanics are not compatible with each other and 
this incompatibility hindered the development so far of a comprehensive and viable theory of quantum 
gravity. This is one example of the many inconsistencies in modern physics that justify a call for revision 
and change. The least that can be said about this incompatibility is that even if both theories are 
fundamentally correct and valid in their domain of application, there could be an intrinsic incompatibility 
in their formalism and methodology?®*! that prevents merging them into a single consistent theory, and 
this is not a trivial defect that can be fixed by just introducing minor or ad hoc changes here and there. 


[288] 4 likely example for incompatibility is the use of curved spacetime in general relativity and flat space-time (or spacetime) 
in quantum mechanics (and its derivatives). 
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Regardless of what is right and what is wrong (and regardless of even the existence of right and wrong), 
it seems necessary to discard at least one of these theories to sustain the natural development and healthy 
progress of modern physics. In our view, if we have to keep one of these theories we should keep quantum 
mechanics because it is the theory that is well supported by experiment, or at least it is the one that is more 
supported by experiment. A brief look around us and an abrupt inspection to our daily life will reveal that 
quantum physics is everywhere while general relativity is almost no where (at least because all the alleged 
practical applications of general relativity as a gravitational theory can be equally explained by classical 
gravity while the alleged non-gravitational applications are connected directly to Lorentz mechanics which 
is an independent theory).|?8°! In fact, a broad familiarity with general relativity is sufficient to reveal 
that this theory is largely a sort of intellectual luxury that is enjoyed by an elite of theoretical physicists, 
mathematicians, philosophers and cosmologists most of whom are completely detached from reality. This 
should be enough to make our choice and move forward. 

However, we should remark that this does not mean that quantum mechanics is a perfect theory. In 
fact, there are many epistemological problems and difficulties in quantum mechanics. So, despite its 
huge empirical success quantum mechanics is not an ideal theory. In other words, quantum mechanics 
may provide very reliable rules from a practical perspective, but it is not ideal from a theoretical and 
epistemological perspective. Anyway, quantum mechanics is not the subject of the present book and hence 
we have no reason to discuss these issues. 

As we noted earlier, quantum mechanics (and all its derivatives such as quantum field theory) does 
not need the paradigm of curved spacetime and hence it is in harmony from this perspective with other 
branches of physics and this should make any potential unification of quantum mechanics with other 
theories viable and consistent from this perspective. This is unlike general relativity which is based on 
the paradigm of curved spacetime and this paradigm (which distinguishes general relativity from other 
major physical theories) can be a potential cause for hindering the unification. We should also note that 
we have reservations about the current approaches about quantum gravity and the unification of physical 
theories and the nature of debates and suggested methods in dealing with quantum gravity in modern 
physics. However, these issues are not within the scope of this book. 


10.1.11 Gaps, Ambiguities and Question Marks 


General relativity contains gaps, ambiguities and questionable issues in its theoretical structure and hence 
the position of the theory about many important issues is problematic or not clear cut. For example: 

e What is the position of the theory when the acceleration is not uniform (i.e. what rules should apply, e.g. 
Lorentz rules or something else) and to which gravity this situation corresponds? This mainly originates 
from the many restrictions and conditions that are embedded within the equivalence principle and this 
should propagate to the formalism of the theory and its foundation. 

e What is the position of the theory about decelerating and rotating frames and to which gravity they 
should correspond according to the equivalence principle and hence what physical laws should apply in 
these frames (see exercises 23 and 24 of § 1.8.2)? In fact, this should also be linked to the more fundamental 
issue of absolute frame (which is denied by general relativity or at least by general relativists) where 
the theory is required to address this issue and all its consequences (such as the effect of the existence 
of absolute frame on the equivalence principle; see § 6.5) properly and seriously. To the best of our 
knowledge, there is no proper and profound investigation in the literature of general relativity to these 
key issues (let alone viable solutions). 

e What is the position of the theory when the gravitated object is not a test particle and hence it modifies 
the gravitational field of the gravitating object substantially? This arises from the fact that in this case the 
distribution of matter depends on the curvature of spacetime while the curvature of spacetime depends on 
the distribution of matter and this is a form of circularity (rather than a form of non-linearity as presented 
in the literature although it could lead to non-linearity). Accordingly, in highly complex physical situations 


[289] Even if global positioning system is really an application of general relativity (considering the many problematic issues 
about this as discussed in § 9.3.4), it cannot be compared (neither qualitatively nor quantitatively) to the applications 
of quantum mechanics which enter in every detail of our daily life in a spectacular manner. 
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(where the distribution of matter is complicated and where gravitating and gravitated objects cannot be 
defined and distinguished from each other) the application of the theory and its strategy is problematic 
(at least because the paradigm of geodesics is not feasible). This means that sensible application of the 
theory is limited to rather simple physical situations and cases (where test particles follow well defined 
geodesics of a curved spacetime created by a gravitating object or source of gravity). It is noteworthy 
that in the literature of general relativity there are some proposed fixes to this problem but they are not 
convincing or general; moreover, in most cases they are not practical. 

e What is the position of the theory when the gravitational frame is not in a state of free fall and hence 
special relativity should not apply even locally (see for example exercise 3 of § 6.1 and the Problems of 
that section)?!?9°l Some general relativists suggest that fictitious forces should be assumed in such a frame 
in order to account for the effects of gravity (and hence we may still be in need for the classical paradigm 
of gravitational force although it is classified as fictitious).?9!! Other general relativists seem to suggest 
that a law that holds true in the absence of gravity (i.e. it is a valid special relativistic law) should also 
hold true in the presence of gravity. But this on its face (apart from the lack of observational evidence in 
its support and hence it is a baseless claim) is inconsistent with the equivalence principle where special 
relativity should hold (locally) only in freely falling frames (because otherwise the effect of gravitational 
acceleration should be null). A more rational suggestion (which may be an elaborate version or clarification 
of the latter suggestion) claims that to account for the effect of gravity in stationary gravitational frames 
(or non freely falling gravitational frames to be more general) we should first apply special relativity in the 
local (freely falling) inertial frame and then we transform the laws to the stationary (or non freely falling) 
frame by a general transformation where this transformation is justified by the equivalence principle. In 
fact, this is inline with the Principle of General Covariance (see Problem 3 of § 1.8.2) and it seems to be 
the most logical and consistent stand (within the framework of general relativity) in addressing the issue 
of stationary (and non freely falling) gravitational frames. However, apart from the dependency of the 
validity of this on the validity of the strong equivalence principle which is questionable (due to the lack 
of evidence in its support or even the existence of evidence against it), the logical foundations of this can 
be questioned and challenged. Moreover, ambiguities and question marks can still be found in regard to 
this type of frames and their physical status. 

e Why some forms of energy (e.g. electromagnetic) can be a source of gravity while other forms of energy 
(e.g. gravitational waves) cannot (at least according to the view of some general relativists). Apparently, 
this originates from the bizarre treatment of gravity in general relativity as a curvature of spacetime 
rather than a physical phenomenon in spacetime and hence gravitational waves (which are supposed to be 
distortions of the fabric of spacetime) cannot be a source of gravity although they are essentially a form 
of energy since they can carry energy and momentum like electromagnetic waves. So, we simply deprive 
gravitational waves of the qualification of being a proper source of gravity only because we modeled them 
in a bizarre way and not because of a real physical necessity or requirement or an intrinsic difference from 
other forms of energy. In brief, we need clear and rigorous criteria, as well as justification, for what forms 
of energy can be a source of gravity and why being a source of gravity is restricted to these forms and why 
other forms are excluded. An issue that is closely related to the issue of which form of energy can be a 
source of gravity is the problem of the contribution of the binding gravitational energy to the gravitational 
mass of gravitating objects. This problem is similarly confused in the literature and requires clarification 
by general relativists. 


[290] Referring to exercise 8 of § 1.8.2 we may clarify this by saying: from a dynamic perspective freely falling frame in 
gravitational field is equivalent to inertial frame in free space and hence it is subject to the laws of special relativity, 
but from a static perspective stationary frame in gravitational field is equivalent to accelerating frame in free space 
and hence it is subject to what laws? Is it subject to the laws of special relativity in the instantaneous inertial rest 
frames? If so, then what is the evidence for this (noting that there is in fact evidence against it since the physics in an 
accelerating frame is not necessarily the same as the physics in the series of its instantaneous inertial rest frames; see 
for example exercise 3 of 10.1.1)? 

[291] We should note in this context that Newton’s second law for example is not valid in gravitating frames (i.e. frames not 
in free fall) unless we assume fictitious forces (see § 6.1). We should also note that we may accept the gravitational 
component of general relativity but we may not accept the generalizations of the equivalence principle where Lorentz 
mechanics is extended to apply locally in all frames as if they are inertial (even if the conventional procedure of general 
covariance is followed). 
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e There are also many ambiguities, question marks and theoretical disparities about many details and 
technicalities like gravitational length contraction (refer to § 8.5 and § 9.5). This is also true with regard 
to the invariance of physical laws and the type of the required transformations between different frames as 
discussed earlier (see for example § 10.1.5). In brief, there are many ambiguities and uncertainties about 
these issues and their alike. 

e Whether energy conservation principle (as well as other similar conservation and non-conservation 
principles; see for example exercise 1 of § 10.1.3) is valid in general relativity or not and in what sense 
(e.g. local or global). There are different opinions and attitudes about this issue where all these opinions 
and attitudes claim to represent the stand of general relativity. We clarified our opinion (which is based 
essentially on analyzing the formalism, epistemology and applications of the theory itself) about this issue 
in several places (see for example § 7.9 and § 10.1.3), and we do not need to repeat. 


It is noteworthy that the above are just a few examples and thus there are many other examples of the 
lack of clarity (and even consistency) in the theory about important issues and this is a main source of 
controversy about the theory and its application and interpretation even among its followers (see § 10.2.5). 
In fact, there is no field in physics that contains such chaos of conflicting opinions and interpretations 
as the relativity theories (and general relativity in particular) even among the top experts in this field 
and this should reflect the lack of clarity in the theory itself (and indeed even lack of consistency since a 
theory that is capable of accommodating and accepting all these contradicting views and interpretations 
should be inconsistent or at least its consistency should be questionable). 


Exercises 


1. Discuss and assess the issue of whether the gravitational field itself contributes to the source of gravity 
(as represented primarily by the distribution of matter and energy) or not and whether gravitational 
forms of energy (such as gravitational waves) make such a contribution or not. 

2. According to the Principle of General Covariance the effect of gravity on a physical system can be 
assessed by the procedure described in the Problems of § 1.8.2 (also see exercise 3 of § 3.5). Accordingly, 
the problem of stationary local frame in a gravitational field (or non freely falling gravitational frame 
to be more general) is not a gap or ambiguity or questionable issue since the effect of gravity can be 
accounted for by this procedure. How do you respond to this? 

3. Discuss briefly the gaps (and potential gaps) in the investigation of the principles of invariance and 
equivalence (and their combination as represented by the Principle of General Covariance) in the 
literature of general relativity. 


10.1.12 Over-Mathematization of Physics 


General relativity might be criticized for its over-mathematization of physics. This should be obvious 
because general relativity geometrizes gravity which is a physical phenomenon. The danger of over- 
mathematization is that it naturally leads to illusions and fantasies since mathematics is full of artifacts 
and abstract objects that do not correspond to physical reality (e.g. we have imaginary solutions, singu- 
larities, infinities, senselessly-negative entities like mass, etc.). So, even if general relativity is essentially 
correct it could become dangerous from a theoretical perspective since it can lead to illusory beliefs 
about the physical world. In fact, we can see this in reality where general relativity generated and 
incubated many fantasies and pseudo-scientific theories thanks to its heavily mathematized approach. 
Over-mathematization also leads to complexities which consume precious resources and hinder progress 
and development. For example, solutions (whether analytical or numerical) are very difficult (if not 
impossible) to obtain except in very simple and rather trivial cases. 


10.1.13 Violation of Sacred Rules 


General relativity may be criticized for its lack of respect to fundamental physical and epistemological 
principles and its violation (or potential violation) of sacred rules such as the conservation principles 
and the rules of causality and physical reality. In fact, there are many examples of these violations (or 
potential violations); some of which were discussed earlier (see for example § 10.1.3) while others will be 
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discussed in the questions of this subsection as well as in the upcoming subsections. This could point 
out to a major defect in the theory and its theoretical framework because these rules and principles are 
generally well established (or supposedly so) and hence their violation based on a pure theoretical basis 
(e.g. as a demand of the theory of general relativity) should not be allowed. In other words, in the 
absence of experimental and observational evidence on their violation any theory that violates these rules 
and principles should be rejected (or at least questioned) because it is inconsistent with experiment and 
observation (at least so far and prior to the verification of the theory). In fact, even if we accept in 
principle the possibility of the violation of these rules and principles by new evidence that can emerge in 
the future, the theory should not be accepted unconditionally until such violation is verified (meanwhile 
the theory should be put on hold). 


Exercises 


1. Give an example of a general relativistic violation (or potential violation) of a sacred rule from special 
relativity, and assess in detail the significance of this violation and its impact on special relativity as 
well as on general relativity. 

2. Give another example of a violation (or potential violation) of sacred rules by general relativity. 

3. The call for formulating the physical theories (including gravity) using flat spacetime may be challenged 
by the possibility that the physical spacetime could be inherently curved. Discuss and assess this issue 
briefly. 


10.1.14 Circularity 


The methodology of general relativity is based on the philosophy that the state of matter (i.e. its dis- 
tribution and motion) determines the state of spacetime (i.e. its geometric properties) while the state 
of spacetime determines the state of matter. This is eloquently expressed by the famous idiom “matter 
tells spacetime how to curve, spacetime tells matter how to move”. In fact, this may be fine if this idiom 
(which really reflects the essence of general relativity as a gravity theory and its methodology) expresses 
the relation between a gravitating source of gravity and a gravitated test particle where the “gravitating 
matter” tells the spacetime how to curve while spacetime tells the “gravitated matter” how to move. But 
in more complicated situations where there is no distinction between gravitating and gravitated objects 
this methodology is problematic conceptually and practically since it is effectively circular. 

The circularity problem also extends (directly or indirectly) to the problem of “gravity of gravity”. This 
problem is reflected in the literature of general relativity within different contexts such as the issue of non- 
linearity and the energy density of gravitational field. Almost all (if not all) these problems and difficulties 
originate from the problematic central paradigm of general relativity that is based on distinguishing gravity 
from other physical phenomena by considering it a curvature attribute of spacetime rather than a physical 
phenomenon that takes place in spacetime. In this regard, we find in the literature complaints like “the 
task of assigning an energy density to a gravitational field is notoriously difficult” (see Hobson et al. in 
the References). What is really “difficult” (if not impossible) is the central paradigm of general relativity 
which leads to such dead ends and this paradigm should be revised or replaced if we have to avoid these 
notorious difficulties. 


10.1.15 Using Einstein Tensor to Represent Curvature 


The use of the Einstein tensor to represent and quantify the curvature of spacetime in the Field Equation is 
rather arbitrary!29?! and questionable because the rank-2 Einstein tensor contains rather limited curvature 
information since the full curvature information is contained in the rank-4 Riemann-Christoffel curvature 
tensor and the contraction of the latter tensor to obtain the Ricci curvature tensor and scalar (from 
which the Einstein tensor is formed) could remove essential curvature information. So, if gravity is 


[292] Ty fact, even the assumption that the spacetime of general relativity is Riemannian (or rather pseudo-Riemannian) is 
arbitrary. 
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really correlated to the curvature of spacetime (since it originates from this curvature) then this should 
compromise any gravity theory that uses the Einstein tensor to formulate gravitation.?%| 

For example, in a curved spacetime whose Riemann-Christoffel curvature tensor is not zero but its Ricci 
curvature tensor (and hence its Einstein tensor) is zero we have T),, = 0 (according to the Field Equation) 
and therefore the spacetime is curved without the presence of matter and energy. This should be a 
consequence of the fact that less curvature information are contained in the Ricci curvature tensor than 
in the Riemann-Christoffel curvature tensor and hence despite the curvature of the spacetime the Field 
Equation fails to represent the gravity of this missing part of the curvature since it predicts there is no 
gravity despite the presence of curvature (or it predicts gravity, represented by curvature, without energy- 
momentum source). In brief, gravity seems to be linked arbitrarily to a certain part of the spacetime 
curvature (in a certain way) and this requires justification.?94! 


Exercises 


1. Summarize the essence of the issue discussed in this subsection. 

2. What is the implication of using the Einstein tensor (rather than the Riemann-Christoffel curvature 
tensor) to describe gravity? 

3. Propose a challenge to the following statement: using the Einstein tensor to represent and describe 
gravity in general relativity is arbitrary and incomplete representation of gravity since the rank-2 
Einstein tensor captures only part of the spacetime curvature due to the fact that this curvature is 
fully represented by the rank-4 Riemann-Christoffel curvature tensor. 

4. A justification for the seemingly arbitrary choice of the Einstein tensor G“” to represent the curvature 
of spacetime is that it is the only rank-2 tensor that is second order in the derivatives of the metric 
for which the divergence vanishes (i.e. G“, = 0). Is this justification sufficient to address the issue of 
arbitrariness and partial representation of curvature? 

5. Comment on the following quote from Rindler (see References): “The choice of G',, for the LHS of 
the field equations may at first seem somewhat arbitrary. But it can be shown [cf. D. Lovelock, J. 
Math. Phys. 13, 874 (1972)| that G,,, is the only tensorial and divergence-free function of the g,,, and 
at most their first and second partial derivatives, that, when put on the LHS of the field equations, 
allows Minkowski space as a solution in the absence of sources”. 

6. Comment on the following quote from Synge (see References): “In Einstein’s theory, either there is a 
gravitational field or there is none, according as the Riemann tensor does not or does vanish. This is 
an absolute property; it has nothing to do with any observer’s world-line”. 

7. It may be claimed that although the curvature of spacetime is not totally represented in the Field 
Equation and even if the representation by the Einstein tensor is arbitrary, the main empirical criterion 
for any “correct” theory is its compliance with experiment and observation and hence as long as general 
relativity is supported by experimental and observational evidence this should not be a defect in the 
theory. How to address this? 


10.1.16 Ambiguity of Vacuum Equation 


As indicated earlier (see Exercises of § 3.2) the vacuum equation R,,, = 0 of general relativity cannot be 
solved unless we start from a guess about the metric. Hence, it cannot be solved by the natural (or ideal) 
method of obtaining the metric from the mass-energy distribution (as explained in § 4) and this should 
introduce serious difficulties in most cases. The reason is that the vacuum equation R,,, = 0 is local (since 
it is restricted to the vacuum region) and hence it does not include in itself the source of gravitation. 
Therefore, the effect of the source of gravitation should be incorporated indirectly through the metric 


[293] We may even argue that the use of the Riemannian geometry to describe the alleged gravity-related curvature of 
spacetime is arbitrary and it requires justification because even if we accept that gravity is a demonstration of spacetime 
curvature it is not necessary that this curvature is based on a Riemannian geometry since the geometry of spacetime 
can be non-Riemannian. 

[294] 1m our view, this issue should be linked to the Weyl (or conformal) tensor which is thoroughly investigated in the 
literature. We should also note that another potential source of arbitrariness in the formulation of curvature in general 
relativity is the common use of Christoffel connection rather than other types of connection. 
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which effectively is the only entity present in R,,. This can be revealed by a careful inspection of the 
solutions of the vacuum equation (e.g. Schwarzschild solution; see § 4.1.1) and how they are obtained 
where the source of gravity is usually inserted in the metric in a rather twitchy way (e.g. by comparison 
to the Newtonian solution in the classical limit). Such approach may apply in certain circumstances and 
cases but it is not general and this should represent a major difficulty in obtaining a solution in most 
cases. This should also aggravate the need for outside theories (e.g. classical gravity) and the dependence 
on external help and guidance to obtain solutions (refer for example to § 10.1.17). Moreover, it may even 
impose limits on the general validity of the obtained solution, e.g. by appealing to the classical limit. 

We should note that the criticism of ambiguity (and related complications) may also be directed to the 
vacuum problems in classical gravity (and indeed to any gravity theory) where the gravitational potential 
takes the role of the metric and hence this criticism is not specific to general relativity since it originates 
from the nature of the vacuum problems. However, the situation in classical gravity is much simpler (due 
for example to the scalar nature of the Poisson equation) and hence this is not a major handicap or hurdle 
for finding vacuum solutions. 

We should also remark that there is no guarantee that in the vacuum problems and solutions of general 
relativity (and indeed in any problem in general relativity; see § 10.1.15) full curvature information are 
accounted for and represented in the formulation due to the fact that in principle the rank-2 Einstein 
tensor (which represents the curvature of spacetime in the Field Equation) contains only part of the 
curvature information of spacetime which is contained fully in the rank-4 Riemann-Christoffel curvature 
tensor (see exercise 9 of § 2.13).1?95! This may be seen as another potential source of ambiguity in the 
vacuum problems and solutions of general relativity (and indeed in any problem in general relativity). 


10.1.17 Need for Classical Gravity 


An important thing to note about general relativity is its need to classical gravity in many situations 
and contexts and this should compromise the integrity and independence of the theory. In fact, this 
can be easily seen in many places in this book as well as in other books where the guidance of classical 
gravity is essential in analyzing the situation and formulating the physics. Moreover, in many practical 
situations related to real physical problems the proposed and employed general relativistic formulations 
are not purely general relativistic and hence they are actually hybrid classical-relativistic formulations 
(which should reflect the need of general relativity to external help). In fact, even the language and 
method of presentation of many relativistic ideas and paradigms require the help of classical gravity. For 
example, it is common in the literature of general relativity to use classical paradigms and terminology 
like gravitational field, tidal force, gravitational potential energy, etc. 

We may also add in this context the possible need of general relativity (or the need of some general 
relativists) for the classical concept of “gravitational force” in some situations, e.g. in the frames that are 
in a gravitational field but they are not in a state of free fall (i.e. frames of type c in the investigation of 
§ 6.1). However, this may be refuted by the allegedly fictitious (and hence presumably non-gravitational) 
nature of the force; moreover, it may be interpreted as an effect of acceleration according to the equivalence 
principle (although this requires physical substantiation). In fact, this may also be refuted more elegantly 
by the rationale of the Principle of General Covariance although this may not be sufficient to address all 
the difficulties that arise from the disposal of “gravitational force”. We should finally refer to the upcoming 
discussion of § 10.1.18 which may be regarded as another example of the persistent need for gravitational 
force. 


10.1.18 Failure of Geodesic to Replace Force 


The paradigm of “geodesic” is hailed in the literature of general relativity as an ingenious and innovative 
replacement of the old fashion paradigm of “force”. Apart from the fact that this is not such an ingenious 


[295] We are implicitly assuming that the curvature information in the Riemann-Christoffel curvature tensor can be classified 
into two parts: a part represented by the Einstein tensor and a part not represented by this tensor. Hence, if the latter 
part is void then the curvature information in the Riemann-Christoffel curvature tensor is fully represented by Einstein 
tensor; otherwise it is not. 
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and innovative idea (as explained earlier; see for example exercise 8 of § 2.9), there are many examples in 
which we find ourselves still in need for the paradigm of force to explain the physical situation satisfactorily. 
In fact, the paradigm of geodesic may explain the geometrical and kinematical aspects of the physical 
behavior of the gravitated object (i.e. its trajectory in spacetime) but it cannot explain the dynamical 
aspects of this behavior.29°! For example, let have a gravitated object that is released from rest in the 
“oravitational field” of a gravitating object. According to the old fashion paradigm the gravitated object 
is attracted by the force of gravity and hence it will start moving and accelerating toward the gravitating 
object along a certain trajectory that is defined by the gravitational field.297] However, the paradigm of 
geodesic cannot provide a full explanation of the movement of the object. To be more clear, even though 
the paradigm of geodesic may describe how the object moves along a certain trajectory (ie. determines 
the shape of trajectory in spacetime) it cannot explain why it should move at all (initially and along this 
trajectory) and follow this particular trajectory. In brief, while the paradigm of force is dynamic and 
lively (due to its physical nature) and hence it can provide a full physical explanation, the paradigm of 
geodesic is static and lethargic (due to its geometric nature) and hence it can provide only partial (and 
rather phenomenological) physical explanation (i.e. how but not why). In fact, this could have practical 
(and not only theoretical and conceptual) consequences.|?9°! 

Another demonstration of the failure of the geodesic paradigm to replace the force paradigm is the 
lack of symmetry in the concept of geodesic (where we have action of one mass, i.e. gravitating, and 
response by another mass, i.e. gravitated) in contrast to the concept of attractive force where each mass 
attracts the other mass as it should be (for obvious logical and physical reasons). The lack of symmetry 
in the concept of geodesic may invalidate Newton’s third law whose essence is the symmetry of interaction 
between physical objects which is a requirement for the equivalence between the objects in the absence 
of any discriminating factor (i.e. why should one object attract the other object but not the other way 
around?).!?99| The situation will be more obvious when the two objects are identical (i.e. have the same 
mass, the same shape, etc.) where the symmetry in the interaction (as expressed aptly by the concept of 
force but not by the concept of geodesic) is a necessity. In fact, this should also be linked to the problem 
that we discussed earlier (see the third point in § 10.1.11; also see § 10.1.14) in regard to the situation 
where the gravitated object is not a test particle (and hence its gravitational field is not negligible). 

In brief, “geodesic” is a geometric paradigm and hence when it is applied to a physical situation it may re- 
flect the physical phenomenon kinematically but not dynamically, while “force” is a full dynamic-kinematic 
paradigm and hence it fully reflects the physical phenomenon dynamically as well as kinematically. More- 
over, the paradigm of force is more general than the paradigm of geodesic in its application to various 
physical situations. So, the paradigm of force contains more physical content (i.e. the dynamics and 
kinematics and in all cases) of the physics of gravity than the paradigm of geodesic (which is restricted 
to the kinematics of gravity and only in some cases). 


Exercises 
1. Discuss the issue of geodesic deviation (see § 7.13) in the context of the failure of “geodesic” to replace 
“force”. 


2. Assess the paradigms of “geodesic” and “force” from the viewpoint of point-like and extended gravitated 
objects. 


296] We may also say: the paradigm of geodesic may provide a phenomenological explanation but not a fundamental or 


essential explanation. 

We should note that with the help of Newton’s second law the tempo-spatial trajectory (and not only the spatial 
trajectory) can be determined. 

We note that other relativistic paradigms (such as geodesic deviation) may aid the geodesic paradigm to overcome 
some difficulties and shortcomings. However, this cannot address all the difficulties and shortcomings in a general and 
fundamental way. In fact, even some of these aiding paradigms (such as geodesic deviation) are essentially about how 
rather than why. 

In fact, Newton’s third law could be seen as a demonstration of fundamental spacetime symmetries and hence it has a 
fundamental value from this perspective (regardless of the paradigm of force). 
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10.1.19 Limitation of Evidence 


As we saw earlier (refer to § 9), there are many limitations and restrictions on the claimed evidence in 
support of general relativity. For example: 

e There are specific limitations on the claimed evidence individually since almost all the claimed evidence 
can be challenged in their validity or value (in various ways) as explained in § 9. 

e There are limitations on the nature of evidence as it is generally restricted to the gravitational aspects 
of the theory and hence it does not extend to the “General Theory” aspects. 

e There are limitations on the geometry of the spacetime since most evidence are related to the Schwarzschild 
metric in the vacuum region. 

e There are limitations on the endorsed (or rather supposedly-endorsed) aspects of the theory, e.g. time 
dilation but not length contraction. 

e There are limitations on the range of validity as exemplified by the strength of the gravitational field 
since most of the claimed evidence are related to the weak regime of gravitational field (Newtonian and 
quasi-Newtonian). 

e There are limitations on the formulation in some cases (such as the use of the linearized form of the 
Field Equation) and therefore any alleged evidence based on such formulations should be restricted by 
the limited (and potentially questionable) validity of these formulations. 


Overall, there are many features and aspects of the theory that are not validated (and even investigated) 
sufficiently and properly. Hence, the validity of the theory in its entirety and generality is not established 
even if we accept all the claimed evidence in support of general relativity and ignore all the claimed 
evidence and challenges against it (such as the violation of the strong equivalence principle and the 
inconsistencies and absurdities in its theoretical structure and applications). 


Exercises 


1. Discuss the issue of the limitation of the claimed evidence in support of general relativity to the 
gravitational aspects. 

2. What is the significance of the fact that most of the claimed evidence and tests in support of general 
relativity are related to the Schwarzschild metric in the vacuum region? 


10.1.20 Absurdities and Paradoxes 


We can find many inconsistencies and paradoxes in general relativity where most of these inconsistencies 
and paradoxes originate from the highly mathematized nature of the theory and its use of the geometric 
paradigm of spacetime curvature to model physical phenomena. For example, let analyzel?°°l the fall 
of an observer Oj, radially toward a black hole where he is observed by another observer O2 who stays 
stationary at a certain location in the space far away from the black hole. It is shown in the literature 
that the time required for such a journey (i.e. the fall to the singularity) is infinite according to O2 while 
it is finite according to O, (also see exercises 2 and 4 of § 7.12). Now, any time of any observer in the real 
world should be finite and hence the fall of O, to the singularity (and even reaching the event horizon) 
will never happen according to Og while it will certainly happen at some time according to O,. In fact, 
many direct and indirect physical consequences are based on the finity of the fall time, e.g. the increase 
of the black hole mass or angular momentum by the falling matter according to the outside observers and 
hence if the fall time is infinite according to these observers many of the claimed physical consequences 
related to the interaction of black holes with their surroundings will never be observed (and in fact will 
never happen). Many other nonsensical consequences and implications can be drawn from the general 
relativistic formulation and depiction of this journey, e.g. the swap of t and r and the decrease of t with 
the increase of 7 in the region r < Rg (noting that the coordinate time t represents the proper time 
at infinity). Absurdities and paradoxes can also be drawn from other general relativistic formulations, 
applications and solutions. Of course, general relativists are smart enough to find technical fixes to 
these absurdities and paradoxes and their alike but none of these fixes can overcome the impossibility of 


[300] We use in this analysis the Schwarzschild solution in the Schwarzschild coordinates. 
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envisaging this journey consistently and physically as depicted by the general relativistic formalism even if 
we can convince ourselves of its consistency and correctness from a formal and mathematical perspective. 
We note that the standard defence of general relativists (when they admit the nonsensical nature of these 
absurdities and paradoxes) is to blame the “inadequate” coordinates (e.g. Schwarzschild coordinates) for 
these absurdities and their alike. However, instead of blaming the coordinates for these absurdities, it is 
more appropriate to blame the theory that made the coordinates dysfunctional to such extent. 

It should be remarked that some authors seem to suggest that the infinity of the fall time from the 
perspective of the external observer in the above-described journey is apparent. If so, then this will 
put a question mark on the reality of some alleged general relativistic consequences and if the theory 
could lead to misleading results. In fact, this should create more problems than it is supposed to solve. 
We should also note that although some of the absurd implications are based on the use of Schwarzschild 
coordinates specifically (which are usually described as inappropriate or inadequate) the nonsensical nature 
of these implications reveals the limitations of the mathematical conclusions from the application of 
general relativity. In other words, if this nonsense can happen (with our awareness) in the Schwarzschild 
coordinates (i.e. in the formalism obtained by legitimate application of general relativity in association 
with the Schwarzschild coordinates) it could happen (most likely without our awareness) even in the 
non-Schwarzschild coordinates (and even in the non-Schwarzschild geometries) .!3°1] 


Exercises 


1. What criteria we use to assess a theory or a premise from the perspective of inconsistency and para- 
doxes? 

2. According to one version of the journey of a freely falling observer A toward a black hole as observed 
by another observer B at infinity: “B will never see A crossing the event horizon and the image of A 
will remain frozen at the event horizon for eternity”. Comment on this. 

3. Give some examples of general aspects in the theory of general relativity that are common sources of 
inconsistency and paradoxes. 

4. Give some examples of absurdities and paradoxes that originate from the mathematical formalism of 
general relativity. 

5. The blame for paradoxes and absurdities in the applications of general relativity is commonly attached 
to the inadequacy of the employed coordinate systems (e.g. Schwarzschild coordinates). How do you 
challenge this? 

6. Give some examples for the general relativistic absurdities from the physics of Schwarzschild black 
holes. 


10.1.21 The Paradox of Absolute Frame and Reality of Spacetime 


This, in our view, is one of the biggest epistemological paradoxes of general relativity. In fact, it is so 
big that no one seems to notice. In brief, the reality of the Newtonian space-time was denied in special 
relativity where the epistemological essence of this denial is represented by (or originates from) the denial 
of the absolute frame of spacetime (because the existence of absolute frame is a natural consequence of 
the reality of space-time). This may be understandable and acceptable within the framework of special 
relativity because the spacetime in special relativity is a bare skeleton that does not seem to embed 
or contain any physical essence or content (and hence the denial of its reality and thus the denial of the 
existence of absolute frame seems reasonable).!°°! Now, in general relativity (which inherits the spacetime 
of special relativity) we have a spacetime that is so real that it is curved by the presence of matter and 


[301] We should note that even in the mathematical basis of at least some of the “good coordinates” (e.g. Eddington- 
Finkelstein coordinates) we still have the problem of singularity at r = Rg in the transformation from the Schwarzschild 
coordinates to the “good coordinates” but the problem is ignored, and hence the solution is still not ideal from this 
perspective because the singularity is ignored (and thus it is still there) rather than dealt with deeply and fundamentally. 
In fact, if the coordinates are really just labels (as claimed by general relativity) then the nonsense that we obtained in 
the Schwarzschild coordinates should not be resolved by changing the coordinates. We should also note that nonsensical 
results (e.g. traveling back in time) are also obtained from other spacetime geometries like Kerr geometry. 

[302] Ty fact, even the spacetime in special relativity is not really a bare skeleton due to its possession of certain physical 
properties represented for example by “free space” constants like €9, to and c. 
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energy (as if it is a material object) and it even embeds real physical content since gravity (which in 
essence is a physical element) is part of the structure of this “very real” spacetime. In fact, the spacetime 
of general relativity is more real than even the space-time of the Newtonian theory (which was denied 
earlier by special relativity). Nevertheless, general relativity (or at least general relativists) continues to 
deny its absoluteness. 


10.1.22 Constancy, Invariance and Ultimacy of the Speed of Light 


One of the fundamental principles of the relativity theories is that the speed of light is a constant, invariant 
and ultimate speed for any physical motion (although the two theories differ in the global and local 
validity of this principle). However, there are many indications that this principle is not true in general 
since violations of this principle have been observed or deduced (at least seemingly and tentatively) from 
a number of experiments and observations. The most serious challenges to this principle seem to be 
from quantum entanglement and its consequences. If this principle collapsed, then the entire theoretical 
structure of the relativity theories will be under threat. The reader should be aware of the serious 
physical and epistemological consequences of any violation of the relativistic restrictions on the speed 
such as potential violation of causality and action at a distance within the relativistic framework. 


Exercises 


1. What makes the constancy of the speed of light in general relativity local and not global? 
2. Discuss the significance of quantum entanglement and its potential impact on the relativity theories. 


10.1.23 Lack of Practicality and Realism 


General relativity and its applications (notably relativistic cosmology) contain many non-practical aspects 
and this should affect their status as viable and realistic scientific theories (see § 1.9). For example, so- 
lutions to general relativistic problems are very difficult (and in most cases impossible) to obtain. Also, 
the duality of coordinate and metrical variables leads to serious practical difficulties some of which are 
outlined earlier (see for instance § 6.2 and § 10.1.6). There are also many examples of non-verifiable con- 
sequences (e.g. dark energy or certain cosmological and astrophysical occurrences and developments) and 
non-obtainable quantities (e.g. some proper cosmological quantities). The theory and its applications also 
contain or imply many non-realistic fantasies and illusions. In brief, general relativity and its applications 
are not very practical or realistic theories because of their complexity, over-mathematization and highly 
theoretical nature (as well as other reasons). 


10.1.24 Limited Usefulness 


Despite its alleged elaborate structure and aesthetic nature, the usefulness of general relativity as a 
“General Theory” is very limited. This is mainly due to its excessive generality and vagueness (as well as 
other potential reasons discussed earlier in the book and in the previous subsections in particular such as 
triviality of invariance, gaps and ambiguities and lack of practicality and realism). Hence, we can hardly 
find any practical use or application to this alleged “General Theory”. In fact, the actual usefulness of 
even the “gravity theory” is limited since almost all the practical applications of general relativity as a 
gravity theory can be obtained from classical gravity (and potentially from other theories as well). This 
issue will be investigated further later on. 


10.2 Overall Assessment 


In the following subsections we outline a number of general aspects in general relativity that can be 
used to assess the theory from practical and theoretical perspectives. We should note that these aspects 
represent a sample and not a comprehensive list. We should also note that many of the issues discussed 
and investigated in the subsections of the last section can be used for assessing general relativity. 
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10.2.1 Geometric Nature of General Relativity 


General relativity is a geometric theory of physical phenomena, and for this reason (as well as for other 
reasons) it is not intuitive. In fact, this is one reason for our belief that general relativity is not an 
ideal physical theory (neither scientifically nor epistemologically) because its geometric nature diminishes 
its capability to capture the “spirit” of the physical laws behind these phenomena even if we assume 
that the theory is “correct” in essence and can provide precise description and prediction of the physical 
phenomena. In other words, the theory may describe the phenomena by the symptoms rather than by 
the physical content of the phenomena. Moreover, the geometric nature of the theory makes it susceptible 
to mathematical artifacts and fantasies as a result of over-mathematization of what is supposed to be a 
physical theory. Also, unnecessary complications and hurdles will be introduced to physics (whose main 
objective is to understand the physical world and reduce its complexity by using relatively simple physical 
and mathematical patterns) thanks to this excessive mathematical approach and methodology. 


10.2.2 High Complexity 


General relativity is notorious for its complexity and sophistication due largely to its total dependency 
on very complex mathematical machinery (mainly differential geometry and tensor calculus and their 
sophisticated concepts and techniques) which makes the theory difficult to understand and puts limits on 
its usability (e.g. analytical solutions are rarely available and only in very special cases and numerical 
solutions are very difficult to obtain if they are accessible at all). In fact, this is another reason for general 
relativity to be non-ideal. Therefore, developing a simpler theory will be a big advantage to science. 
The possibility of developing an alternative theory originates from our belief that science is a mix of 
discovery and invention and hence any physical phenomenon can be formulated in many different ways 
all of which are correct.!°°%l In other words, science is not unique and hence alien civilizations should 
develop different versions of science to the version that we have. In fact, even in our earthly science we 
can find many examples of multiple correct formulations related to a single physical phenomenon where 
some formulations are superior or more advantageous in comparison to other formulations (although this 
is usually conditional and depends on the contexts and circumstances). It is worth noting that the non- 
unique nature of science (and indeed any form of knowledge) should not affect the principles of physical 
reality and truth (specifically the uniqueness of reality and truth) since these different versions represent 
different demonstrations of the same reality and various expressions of the unique truth. 


10.2.3. Highly Theoretical Nature 


The highly theoretical nature of general relativity (which is an obvious feature of this theory thanks 
to its reliance on a very abstract scientific and epistemological framework) makes physics vulnerable to 
detachment from reality and susceptible to fantasies and illusions such as wormholes and travel in time. 
In fact, general relativity contributed to the departure of modern physics from the realism and rationality 
which characterized classical physics over its long history prior to the appearance of general relativity and 
other similar theories of modern physics. The highly theoretical nature of general relativity also encourages 
the emergence of many hypothetical and conjectural adventures as well as virtually useless experimental 
and observational projects. These adventures and projects do not only consume huge amounts of precious 
resources into futile lavish enterprises of negligible return and waste huge amounts of money, effort and 
brain power which are badly needed in many theoretically and practically important areas of science, but 
they also divert science from its real route of progress and development and take it in the wrong direction 
far away from its real and beneficial objectives. 


10.2.4 Publicizing and Politicizing Science 


The historical route of development of modern physics has changed dramatically since the endorsement of 
general relativity by Eddington and his team in 1919. Although the theory itself cannot be blamed for such 


[303] We are implicitly assuming the correctness of general relativity although this is not necessarily the case. 
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historical developments and their subsequent consequences, they remain a decisive factor in determining 
the route and destiny of science and hence they should be seen as part of the legacy of the theory and its 
practical impact on science. Therefore, they should be taken into account when assessing the contribution 
of the theory to science and evaluating its potential impact (whether positive or negative) in the future. 

In brief, since the endorsement of Eddington and his team to general relativity, strange elements and 
factors entered into science, and physics became publicized and politicized. This is a result of the huge 
prestige that Einstein gained by this endorsement where he became a highly revered public figure and 
not just a scientist. Accordingly, the public fascination by Einstein and the huge support to his theories 
became a factor in determining the fate of science because scientists are humans influenced by the public 
opinion?°41 and hence they do not want to be deprived of public sympathy and support by opposing 
Einstein and his theories. This applies in particular to students and junior academics and researchers 
who through this “Einstein effect” are motivated or forced to support Einstein and his theories. The 
involvement of general media and propaganda machines have also contributed to this “Einstein effect” in 
determining the fate of science. Accordingly, even if Einstein theories (and general relativity in particular) 
are correct and ideal for the present state of science, this publicization and politicization of science will 
have negative impact on the progress of science in the long term because no one should believe that science 
has reached its ultimate destiny with Einstein and his theories and hence we should always look for the 
better and this cannot happen easily and naturally (as it should be) when we have an ultimate authority 
who very few have the will and courage to oppose and criticize. 

The “Einstein effect” also extends to the funding bodies, academic and research institutes, refereed 
journals and in fact to all the institutions of modern science with no exception. So, if someone wants to 
get a research grant or a faculty post or publish a paper in a respected journal or avoid public humiliation 
by being mocked and described (legitimately and aptly!) as foolish crackpot then he must follow the 
line of Einstein and his theories. In fact, the easiest way to commit scholarly suicide or assassinate the 
character of a scientist and terminate his career is to make him an enemy of Einstein and portray him as an 
opponent to his great theories due (allegedly) to personal grudge or envy or lack of decency or deficiency 
in intellect and mental power. All these forms of collective attitude and mobbing conduct contributed 
badly to modern science and made it very different from the old rational science that is restricted to the 
intellectual elites who have the mental power and legitimacy to deal with science and manage its thorny 
issues by legitimate scientific means. [°° 

In fact, these days even the illiterates and semi-literates enjoy chewing the “Einstein” word, with and 
without reason, to feel great and gifted by getting a link to “ingenuity” and gain public respect. Also, 
there is a distinctive feature in the modern scientific literature that is the personalization of science when 
it comes to the relativity theories where the tone changes abruptly to become a biography of Einstein and 
his achievements and personal traits more than a presentation of scientific theories and results. In fact, 
the scientific literature of the relativity theories (and even beyond) is full of this sort of bizarre manner of 
presenting science, most of which is based on personal emotions and tasteless exaggerations. So in brief, 
even if Einstein theories (and general relativity in particular) are correct and ideal from a theoretical 
perspective, with the involvement of these mobbing factors they become non-ideal for the progress of 
science in the long term because they make science like a religious order or cult or political party or 
ideological movement where emotions and prejudices play a major role in deciding what is right and what 
is wrong and determining the direction and destiny. 


10.2.5 Controversies, Conflicts and Uncertainties 


There are many aspects in the theory of general relativity that are not well determined and defined (see 
for example § 10.1.11) and hence they naturally become a source of controversies, conflicts and different 
interpretations and opinions which cast a dark shadow of uncertainty on the theory itself and its value 


[304] This is particularly true with respect to the “public opinion” in educational, academic and research institutes although 
these are not “public” in the general sense. 

[305] We should note that even in the old days of science there were some exceptions but they qualitatively and quantitatively 
differ from the miserable state of modern science. 
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and merit. In fact, a brief inspection of the literature (whether monographs or research and review papers 
or general articles) reveals that most aspects of general relativity are fertile fields for controversy and 
conflicting views and this should reflect an inherent haziness in the theory itself and its conceptual and 
logical structure. Although some of these controversial and obscure aspects belong to the interpretation 
they indirectly affect the formalism and how it is applied. This affects the value of the theory and casts 
doubts over its implications and alleged verifications because these verifications can be tailored according 
to certain interpretations to achieve the objective of endorsing the theory. Also, these gray areas of 
general relativity become a rich source of wild speculations, bizarre opinions and other sorts of fantasies 
and detachment from reality. 

In brief, although the mess and confusion in the literature of the relativity theories (i.e. including special 
relativity) can be partly attributed to the exceptional complexity of these theories (mainly the general) 
and their unusual nature, if we dig deep we will find that the principal reason behind the mess and 
confusion (which are linked to the controversies, conflicts and uncertainties) is actually the lack of clarity 
and definiteness in the theoretical framework of these theories where many hazy concepts, interpretations 
and formulations!®°°! are employed in the construction of these theories. We can also add to this the lack 
of rationality in many of the arguments and reasoning which these theories are based on since irrationality 
naturally leads to controversies, confusion and mess. 


10.2.6 Practical Value 


We have a number of observations and reservations about the practical value of general relativity; some 
of these are outlined in the following points: 

e As indicated earlier, general relativity has limited practical value due to its immense complexity and 
highly theoretical nature (noting that almost all the practical parts of the theory can be easily obtained 
from classical gravity). Analytical solutions (whether exact or not) are very few and are mostly related to 
simple or trivial cases whose results are already known from classical physics or observations and hence 
they are mainly for the purpose of checking and testing general relativity itself rather than producing 
novel results.!8°7l In fact, even numerical methods and other approximation techniques become highly 
complex and demanding when used with this theory. 

e In many cases general relativity needs the guidance of classical theory and this highlights a limitation 
of general relativity since it does not work independently. 

e The alleged practical results and benefits that are claimed to be obtained from general relativity such 
as the corrections to the global positioning system (GPS) can generally be obtained empirically or from 
classical gravity or from alternative theories and methods. 

e General relativity is not tested (at least sufficiently) in extreme gravitational systems (and hence it 
may mislead science in the future in the investigation of these systems due to the reliance on the alleged 
success of general relativity so far). 

e The incompatibility with quantum mechanics (plus the dogmatic persistence on keeping general relativity 
as the leading theory of gravity) hindered and will continue to hinder the progress of science especially in 
the field of finding a unified physical theory. 

e There is hardly any tangible benefit from general relativity as a “General Theory”. 

e When we compare general relativity with other theories of modern physics (such as quantum mechan- 
ics and Lorentz mechanics) we find that the actual yield of general relativity is trivial. For example, 
quantum mechanics revolutionized not only modern science and technology theoretically and practically 
but it revolutionized even our daily life and our philosophy and conceptions about the physical world 
and the epistemological framework of science. On the other hand, general relativity (with all its lavish 
requirements and high cost and the generous support that it enjoyed since the first years of its appear- 
ance) made very little practical impact. For instance, almost all the serious astronomical investigations 


[306] As seen, for example, in having several concepts for distance and having several interpretations for length contraction 
and having several formulae for Shapiro time delay. 

[307] This is in sharp contrast to other scientific theories like quantum mechanics whose results cannot be replicated or 
substituted by any other theory. 
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and space adventures (such as sending spaceships and satellites to the outer space) can be made (and 
are usually made) using the classical gravity theory with purely classical analytical tools and methods. 
This also applies to the theoretical value of general relativity!2°*! because almost all the claims on which 
general relativity is based are redundant conceptually and theoretically and can be replaced by their 
classical counterparts or alternative theories. In brief, general relativity is based on a set of unverified 
and unverifiable ideas and framework (e.g. gravity is a distortion in spacetime and hence it is not a force 
like other forces) that may revolutionize the thinking of whoever believes in these claims and chooses to 
revolutionize his thinking in this way but this is not necessarily the case because we can keep our classical 
conceptions and framework (or even embrace alternative theories) such as thinking of gravity in terms of 
gravitational field and hence it is a force like any other force. This is unlike quantum physics, for instance, 
where we cannot keep our classical conceptions and framework (or use alternative theory) when we deal 
with quantum systems. The reality is that most of the significance of general relativity and its “huge” 
contribution to modern science are the result of the huge propaganda machine behind general relativity 
and the generous support that it enjoyed from the beginning. We strongly believe that if the classical 
theory or some of the alternative gravity theories are given the chance to evolve and are granted a tiny 
fraction of the generous and prejudiced support that is given to general relativity then they will make 
at least as much contribution as the alleged contribution of general relativity.°9! This is due in part to 
the huge complexity of this theory and its many problematic aspects (practical and theoretical) which 
diminish its actual yield despite the huge amount of resources that it consumes. 


Exercises 


1. Argue in favor of the redundancy of general relativity practically. 


10.2.7 Theoretical Value 


We also have a number of observations and reservations about the theoretical value of general relativity; 
some of these are outlined in the following points: 

e General relativity is not intuitive or easy-to-understand at all. Interpreting gravity as distortion in 
spacetime may be mathematically elegant and appealing but it is far from being obvious or natural 
although it may seem so thanks to the systematic indoctrination which starts from the first years of 
science education. The notorious complexity and the lack of intuitivity of general relativity should be 
considered a negative factor rather than a positive factor as it might be imagined and claimed (see 
Exercises). 

e Apparently, the most important contributions of general relativity are in the fields of cosmology and 
extreme astrophysical systems. However, cosmology is not a rigorous science since it is based in large 
part on philosophical contemplations and assumptions that degrade its status as a science. The reality is 
that cosmology (even in its modern version) is a philosophical discipline in essence rather than a physical 
science. Its scientific appearance is largely attributed to the involvement of general relativity and the 
consequent mathematization of this subject. Most of the claimed “facts” and results of cosmology are 
not verified by experiment or observation while many others are not verifiable at all because they are 
beyond the reach of science. Hence, cosmology is not a physical science in a strict sense and therefore the 
alleged achievements of general relativity in the field of cosmological theories and investigations (assuming 
that all these alleged achievements are as claimed) are not really a contribution of general relativity to 
science, although they contain valuable contributions to applied mathematics and philosophy as well as 
legitimate speculations and contemplations about the origin and destiny of the Universe. The situation of 
the general relativistic investigations in the field of extreme astrophysical systems (such as black holes) is 


[308] In fact, the theoretical value of general relativity is discusses in § 10.2.7 and hence this part belongs more aptly to that 
subsection. However, we discuss it here to complete the comparison to quantum mechanics while avoiding discontinuity 
and interruption. 

[309] Tt is noteworthy that the trivial yield of general relativity and its handicaps led to its decline in the 1940s and 1950s 
where it was dying but it was revitalized by orchestrated promotional campaigns from relativity-loving physicists like 
Wheeler. So, general relativity was “created” by the propaganda of Eddington and his alike and resurrected by the 
propaganda of Wheeler and his alike (noting that this propaganda is still going on) and hence general relativity did not 
survive and thrive naturally and by its own merit and strength. 
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very similar since these investigations are highly mathematical and speculative in nature. Moreover, the 
claimed results are mostly not verified at all (noting that even when they are “verified” the verification is 
generally insufficient). Hence, in most cases they cannot be regarded as authentic scientific achievements 
of the theory. 

e We also refer the reader to the last point in § 10.2.6 where we briefly talked about the theoretical value 
of general relativity in the context of comparison to quantum physics. 


Exercises 


1. What is your overall assessment of general relativity? 


2. What are the advantages of having intuitive and easy-to-understand scientific theory? 
3. Is general relativity ideal? 
4. Is general relativity correct? 


Epilogue 


e It may be difficult to reach a firm verdict about the validity of general relativity and if it is right or 
wrong, but we can say with certainty that general relativity is not an ideal theory. Hence, the search for 
a better alternative theory (both as a gravity theory and as a “General Theory”) should be a priority for 
modern physics. The dogmatic embracement of this theory and accepting it as the perfect theory does 
not serve the progress of science. 

e No one can claim that classical physics is a perfect and ultimate theory, but at least it is based on 
logical methodology and rational epistemology whereas modern physics is based in large part on illogical 
methodology and irrational epistemology and the main credit for this trend belongs to theories like general 
relativity. Thanks to the school of thought and methodology of the relativity theory and its alike we 
have things like wormholes, white holes, time machines, dark energy, and so on. In fact, this school 
of thought infested modern physics with distorted logic, metaphysical objects, creation theories, heavily 
mathematized physical models that are completely detached from physical reality, use of observations as 
supplementary tools to justify ready-made theories, ..., and the list extends to include many other crazes 
and heresies. Thanks to this school of thought we now read in our physics texts things like this (quoted 
with minor modifications): 

“At t = 10~*8 s, the universe was extremely dense, small and hot. The universe spanned a region of 107°? 
cm. Matter and energy were indistinguishable. The temperature of the universe was 10°” K. In a tiny 
fraction of a second, the universe expanded rapidly and doubled in size many times ... etc.”. 

This sort of nonsense is taught as solid science and the young minds are indoctrinated and brainwashed 
to believe that this is the true story of the Universe and its alleged creation. This sort of illusion forms 
the framework for huge amounts of academic work and research projects in the universities and scientific 
institutes around the world. In brief, this school of thought spoiled the beauty, rationality and realism of 
physics and reduced it to a fertile ground for hallucination, wild speculations and bizarre ideas. 

e General relativity diverted modern physics far away from its natural and healthy route of development 
thanks to its unusual nature and methodology. What worsens the situation is that the undisciplined 
nature of this methodology encourages many incompetent or ambitious scientists to release their wild 
imagination in fancy scientific enterprises and ideas. Huge resources are spent for example on testing 
very trivial implications and consequences (or alleged implications and consequences) of this theory and 
this resulted in wasting huge amounts of precious resources in futile or trivial investigations instead of 
spending these resources in objective and useful investigations. Although we may be witnessing huge 
progress in modern science, modern science is highly inefficient since it consumes huge resources.819l In 
other words, the scientific process can be highly optimized and the spent resources can be substantially 
reduced for the same amount of real progress. 

e Although general relativity is commonly depicted in the modern literature as the perfect theory 
and described as proven (or well established), it is far from being perfect or proven. It should be more 
appropriate to call it the imperfect theory. Only with bias, propaganda and brainwashing it can become 
the perfect theory. Regardless of its formalism and if it is right or wrong, the logical and epistemological 
foundations of general relativity can be questioned in many details. However, this is generally inhibited 
by the curtain of exaggeration that engulfs this theory as well as the complexity of its formal structure. 
In fact, there are many dark corners in general relativity (both in formalism and epistemology) that have 


[311] 


[310] This should be compared to classical science which is mostly developed by individuals with very little resources. The 
great achievements of classical science (despite its limitations) cannot be denied or disputed. In fact, these achievements 
(considering the limited spent resources) are much greater than the achievements of modern science. 

[311] Interestingly, while the early physicists who witnessed the birth and early development of general relativity (and some 
have worked closely with Einstein) express their awareness of the defects and limitations of general relativity (as 
exemplified by the upcoming quote from Bergmann), the more recent physicists do not spare any opportunity to make 
absurd exaggerations in praising general relativity such as describing it the perfect theory or the most beautiful theory 
ever created. 

Quote from Bergmann (see References): “The third part deals with several attempts to overcome defects in the general 
theory of relativity. None of these theories has been completely satisfactory”. 


234 


not been investigated properly because modern physicists are more concerned about establishing the glory 
of Einstein and his theories than about establishing a reliable scientific theory, and hence amid the frenzy 
of this mystic devotion the truth is lost. 

e Finally, the “eccentric” views that we expressed above (and throughout the book) may be challenged 
by the “huge” developments (particularly in astrophysical and cosmological studies) which are based on 
general relativity. However, we believe that most (if not all) of these developments are motivated and 
staged by general relativity in the sense that they are made with general relativity being in mind as the 
only viable theory. Any other alternative theory will make similar contributions within its framework and 
hence the general relativistic view that dominated and steered these investigations and developments is not 
a necessity. Yes, if these studies were based on objective observations and contemplations without a prior 
presumption and bias in favor of a certain theory, and general relativity was the only viable framework that 
can explain these observations (as it is the case for example in the development of quantum mechanics) 
then the credit for these developments can be given to general relativity. In brief, with the huge bias in 
favor of general relativity many of these developments are no more than re-production (or echo) of general 
relativity and hence they are not genuine scientific advances made by general relativity.8!7] Moreover, 
we question modern cosmology as a credible, rigorous and precise science because it is largely based 
on theoretical speculations and mathematical models with very little observational and experimental 
substance or evidence. In fact, large part of modern cosmology is based on philosophical contemplations 
and assumptions as well as many unverified and even unverifiable premises. For example, large part of 
the theoretical investigations that are based on or related to general relativity, such as dark energy and 
wormholes, belong to metaphysics and hence they cannot be considered contributions of general relativity 
to science. In fact, general relativity sometimes plays a destructive role by re-directing huge resources to 
such futile investigations and taking science away from its natural route and objectives. 

e Although we should do our best to understand the Universe and the laws that govern its behavior, there 
is no guarantee that we can reach this goal. The Universe may be too big and too complex to be fully 
understood or even understandable. Our physical limitations may impose boundary conditions on our 
ability to understand. This view seems to be very realistic from a scientific perspective although this should 
not impose any limit on the legitimacy of our philosophical and spiritual speculations, contemplations and 
experiences. It seems to me that large part of modern physics (especially in cosmology) belongs to this 
category of intellectual activities which are wrongly classified as science, and from this perspective general 
relativity may be a suitable theory for this kind of activities. Nevertheless, because of the many negative 
aspects of general relativity (e.g. theoretical and practical complexity, over-mathematization, lack of 
clarity and definiteness in many aspects, looseness and sloppiness in some formulations, epistemological 
defects, controversies, impracticality, etc.) and for the sake of the progress of real science, physicists 
should look for a better theory to replace this theory both as a gravity theory and as a “General Theory”. 


[312] Th fact, most parts of astrophysics (and even cosmology) can be explained by classical Newtonian gravity even without 
any extension or modification. Moreover, most of the alleged necessities of employing general relativity in some details 
of these subjects are arguable and can be challenged. 
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Eot-Wash experiments, 37 
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Escape speed, 8, 111, 176 
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Extra precession, 8, 157-159, 161-163, 188 
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order approximation, 98, 123 
order deviation, 98 
order perturbation, 122 
order variation, 114 
postulate (of special relativity), 34 
Flat 
geometry, 57, 97 
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space, 8, 16, 143, 144, 220, 227 
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209, 210 
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curvature, 8, 68, 70, 71 
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unit vector, 7, 70 
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inertial frame, 141, 207 
positioning system (GPS), 7, 192, 195-197, 219, 231 
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Gravitated, 21, 141, 220, 222 
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inhabitant, 68, 76 
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Irreversibility, 110 
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first law, 24, 26 
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Kerr 
black hole, 180, 181, 184 
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PSR B1913+16 binary pulsar system, 202 
Pulsar, 188, 202, 210 
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